Dataset statistics
| Number of variables | 58 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 14114 |
| Missing cells (%) | 24.3% |
| Total size in memory | 3.8 MiB |
| Average record size in memory | 3.8 KiB |
Variable types
| Numeric | 1 |
|---|---|
| Text | 53 |
| Unsupported | 3 |
| URL | 1 |
vendor_dba has 873 (87.3%) missing values | Missing |
email has 1000 (100.0%) missing values | Missing |
cert_renewal_date has 270 (27.0%) missing values | Missing |
address2 has 618 (61.8%) missing values | Missing |
mailingaddress2 has 607 (60.7%) missing values | Missing |
website has 283 (28.3%) missing values | Missing |
date_of_establishment has 84 (8.4%) missing values | Missing |
aggregate_bonding_limit has 890 (89.0%) missing values | Missing |
signatory_to_union_contracts has 931 (93.1%) missing values | Missing |
types_of_construction_projects_performed has 1000 (100.0%) missing values | Missing |
name_of_client_job_exp_1 has 42 (4.2%) missing values | Missing |
largest_value_of_contract has 52 (5.2%) missing values | Missing |
percent_self_performed_job_exp_1 has 97 (9.7%) missing values | Missing |
date_of_work_job_exp_1 has 42 (4.2%) missing values | Missing |
description_of_work_job_exp_1 has 43 (4.3%) missing values | Missing |
name_of_client_job_exp_2 has 211 (21.1%) missing values | Missing |
value_of_contract_job_exp_2 has 227 (22.7%) missing values | Missing |
percent_self_performed_job_exp_2 has 262 (26.2%) missing values | Missing |
date_of_work_job_exp_2 has 211 (21.1%) missing values | Missing |
description_of_work_job_exp_2 has 211 (21.1%) missing values | Missing |
name_of_client_job_exp_3 has 328 (32.8%) missing values | Missing |
value_of_contract_job_exp_3 has 344 (34.4%) missing values | Missing |
percent_self_performed_job_exp_3 has 387 (38.7%) missing values | Missing |
date_of_work_job_exp_3 has 328 (32.8%) missing values | Missing |
description_of_work_job_exp_3 has 328 (32.8%) missing values | Missing |
capacity_building_programs has 1000 (100.0%) missing values | Missing |
borough has 380 (38.0%) missing values | Missing |
latitude has 380 (38.0%) missing values | Missing |
longitude has 380 (38.0%) missing values | Missing |
community_board has 380 (38.0%) missing values | Missing |
council_district has 380 (38.0%) missing values | Missing |
bin has 387 (38.7%) missing values | Missing |
bbl has 387 (38.7%) missing values | Missing |
census_tract_2020_ has 380 (38.0%) missing values | Missing |
neighborhood_tabulation_area_nta_2020_ has 380 (38.0%) missing values | Missing |
0 has unique values | Unique |
account_number has unique values | Unique |
vendor_formal_name has unique values | Unique |
email is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
types_of_construction_projects_performed is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
capacity_building_programs is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-12-09 23:12:18.451559 |
|---|---|
| Analysis finished | 2023-12-09 23:12:21.449168 |
| Duration | 3 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
0
Real number (ℝ)
UNIQUE 
| Distinct | 1000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 500.5 |
| Minimum | 1 |
|---|---|
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 50.95 |
| Q1 | 250.75 |
| median | 500.5 |
| Q3 | 750.25 |
| 95-th percentile | 950.05 |
| Maximum | 1000 |
| Range | 999 |
| Interquartile range (IQR) | 499.5 |
Descriptive statistics
| Standard deviation | 288.8194361 |
|---|---|
| Coefficient of variation (CV) | 0.5770618104 |
| Kurtosis | -1.2 |
| Mean | 500.5 |
| Median Absolute Deviation (MAD) | 250 |
| Skewness | 0 |
| Sum | 500500 |
| Variance | 83416.66667 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 672 | 1 | 0.1% |
| 659 | 1 | 0.1% |
| 660 | 1 | 0.1% |
| 661 | 1 | 0.1% |
| 662 | 1 | 0.1% |
| 663 | 1 | 0.1% |
| 664 | 1 | 0.1% |
| 665 | 1 | 0.1% |
| 666 | 1 | 0.1% |
| Other values (990) | 990 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
| Value | Count | Frequency (%) |
| 1000 | 1 | |
| 999 | 1 | |
| 998 | 1 | |
| 997 | 1 | |
| 996 | 1 |
account_number
Text
UNIQUE 
| Distinct | 1000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.9 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.258 |
| Min length | 2 |
Characters and Unicode
| Total characters | 5258 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 311147 |
|---|---|
| 2nd row | 357418 |
| 3rd row | 331157 |
| 4th row | 10003 |
| 5th row | 342721 |
| Value | Count | Frequency (%) |
| 10660 | 1 | 0.1% |
| 102062 | 1 | 0.1% |
| 326940 | 1 | 0.1% |
| 10539 | 1 | 0.1% |
| 330490 | 1 | 0.1% |
| 7109 | 1 | 0.1% |
| 10185 | 1 | 0.1% |
| 10675 | 1 | 0.1% |
| 287182 | 1 | 0.1% |
| 309818 | 1 | 0.1% |
| Other values (990) | 990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 939 | |
| 0 | 870 | |
| 3 | 770 | |
| 2 | 467 | |
| 6 | 443 | |
| 4 | 425 | |
| 5 | 400 | |
| 7 | 325 | 6.2% |
| 9 | 313 | 6.0% |
| 8 | 306 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5258 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 939 | |
| 0 | 870 | |
| 3 | 770 | |
| 2 | 467 | |
| 6 | 443 | |
| 4 | 425 | |
| 5 | 400 | |
| 7 | 325 | 6.2% |
| 9 | 313 | 6.0% |
| 8 | 306 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5258 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 939 | |
| 0 | 870 | |
| 3 | 770 | |
| 2 | 467 | |
| 6 | 443 | |
| 4 | 425 | |
| 5 | 400 | |
| 7 | 325 | 6.2% |
| 9 | 313 | 6.0% |
| 8 | 306 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5258 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 939 | |
| 0 | 870 | |
| 3 | 770 | |
| 2 | 467 | |
| 6 | 443 | |
| 4 | 425 | |
| 5 | 400 | |
| 7 | 325 | 6.2% |
| 9 | 313 | 6.0% |
| 8 | 306 | 5.8% |
UNIQUE 
| Distinct | 1000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.7 KiB |
Length
| Max length | 63 |
|---|---|
| Median length | 43 |
| Mean length | 24.447 |
| Min length | 6 |
Characters and Unicode
| Total characters | 24447 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | #1Pho Inc |
|---|---|
| 2nd row | #NAME? |
| 3rd row | 024 Inc |
| 4th row | 1 Call Building Maintenance Corp. |
| 5th row | 1 Of A Kind Home Health Care L.L.C. |
| Value | Count | Frequency (%) |
| inc | 385 | 10.4% |
| llc | 294 | 7.9% |
| corp | 109 | 2.9% |
| 86 | 2.3% | |
| construction | 71 | 1.9% |
| services | 67 | 1.8% |
| a | 64 | 1.7% |
| consulting | 50 | 1.3% |
| group | 45 | 1.2% |
| all | 36 | 1.0% |
| Other values (1405) | 2503 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2712 | 11.1% | |
| n | 1746 | 7.1% |
| e | 1483 | 6.1% |
| i | 1276 | 5.2% |
| r | 1264 | 5.2% |
| o | 1142 | 4.7% |
| t | 1126 | 4.6% |
| A | 1115 | 4.6% |
| c | 1102 | 4.5% |
| a | 1070 | 4.4% |
| Other values (66) | 10411 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14853 | |
| Uppercase Letter | 5581 | 22.8% |
| Space Separator | 2712 | 11.1% |
| Other Punctuation | 995 | 4.1% |
| Decimal Number | 262 | 1.1% |
| Dash Punctuation | 36 | 0.1% |
| Math Symbol | 6 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1746 | |
| e | 1483 | |
| i | 1276 | |
| r | 1264 | |
| o | 1142 | 7.7% |
| t | 1126 | 7.6% |
| c | 1102 | 7.4% |
| a | 1070 | 7.2% |
| s | 891 | 6.0% |
| l | 810 | 5.5% |
| Other values (16) | 2943 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1115 | |
| C | 913 | |
| L | 764 | |
| I | 532 | |
| S | 374 | 6.7% |
| P | 227 | 4.1% |
| E | 218 | 3.9% |
| R | 171 | 3.1% |
| N | 170 | 3.0% |
| T | 169 | 3.0% |
| Other values (16) | 928 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 57 | |
| 0 | 36 | |
| 4 | 33 | |
| 2 | 32 | |
| 3 | 29 | |
| 5 | 18 | 6.9% |
| 7 | 17 | 6.5% |
| 8 | 15 | 5.7% |
| 9 | 14 | 5.3% |
| 6 | 11 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 524 | |
| , | 352 | |
| & | 98 | 9.8% |
| ' | 14 | 1.4% |
| # | 2 | 0.2% |
| ! | 2 | 0.2% |
| / | 1 | 0.1% |
| ? | 1 | 0.1% |
| : | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2712 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 36 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 6 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20434 | |
| Common | 4013 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1746 | 8.5% |
| e | 1483 | 7.3% |
| i | 1276 | 6.2% |
| r | 1264 | 6.2% |
| o | 1142 | 5.6% |
| t | 1126 | 5.5% |
| A | 1115 | 5.5% |
| c | 1102 | 5.4% |
| a | 1070 | 5.2% |
| C | 913 | 4.5% |
| Other values (42) | 8197 |
Common
| Value | Count | Frequency (%) |
| 2712 | ||
| . | 524 | 13.1% |
| , | 352 | 8.8% |
| & | 98 | 2.4% |
| 1 | 57 | 1.4% |
| 0 | 36 | 0.9% |
| - | 36 | 0.9% |
| 4 | 33 | 0.8% |
| 2 | 32 | 0.8% |
| 3 | 29 | 0.7% |
| Other values (14) | 104 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24447 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2712 | 11.1% | |
| n | 1746 | 7.1% |
| e | 1483 | 6.1% |
| i | 1276 | 5.2% |
| r | 1264 | 5.2% |
| o | 1142 | 4.7% |
| t | 1126 | 4.6% |
| A | 1115 | 4.6% |
| c | 1102 | 4.5% |
| a | 1070 | 4.4% |
| Other values (66) | 10411 |
vendor_dba
Text
MISSING 
| Distinct | 126 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 873 |
| Missing (%) | 87.3% |
| Memory size | 36.8 KiB |
Length
| Max length | 45 |
|---|---|
| Median length | 29 |
| Mean length | 18.76377953 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2383 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 125 ? |
|---|---|
| Unique (%) | 98.4% |
Sample
| 1st row | Zenyai |
|---|---|
| 2nd row | Kilduff Underground Engineering, Inc. |
| 3rd row | WBE NYC |
| 4th row | A La Fresca |
| 5th row | Aigner Chocolates |
| Value | Count | Frequency (%) |
| 9 | 2.5% | |
| group | 6 | 1.7% |
| a | 5 | 1.4% |
| solutions | 5 | 1.4% |
| security | 5 | 1.4% |
| new | 5 | 1.4% |
| services | 5 | 1.4% |
| associates | 4 | 1.1% |
| inc | 4 | 1.1% |
| york | 4 | 1.1% |
| Other values (270) | 306 |
Most occurring characters
| Value | Count | Frequency (%) |
| 231 | 9.7% | |
| e | 193 | 8.1% |
| r | 146 | 6.1% |
| n | 144 | 6.0% |
| o | 141 | 5.9% |
| t | 126 | 5.3% |
| a | 125 | 5.2% |
| i | 122 | 5.1% |
| s | 106 | 4.4% |
| A | 85 | 3.6% |
| Other values (50) | 964 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1622 | |
| Uppercase Letter | 498 | 20.9% |
| Space Separator | 231 | 9.7% |
| Other Punctuation | 26 | 1.1% |
| Decimal Number | 3 | 0.1% |
| Math Symbol | 2 | 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 193 | |
| r | 146 | 9.0% |
| n | 144 | 8.9% |
| o | 141 | 8.7% |
| t | 126 | 7.8% |
| a | 125 | 7.7% |
| i | 122 | 7.5% |
| s | 106 | 6.5% |
| l | 72 | 4.4% |
| c | 71 | 4.4% |
| Other values (16) | 376 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 85 | |
| S | 59 | |
| C | 43 | 8.6% |
| E | 33 | 6.6% |
| I | 28 | 5.6% |
| T | 27 | 5.4% |
| P | 23 | 4.6% |
| M | 22 | 4.4% |
| L | 20 | 4.0% |
| N | 20 | 4.0% |
| Other values (14) | 138 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9 | |
| & | 8 | |
| , | 4 | |
| ' | 3 | 11.5% |
| / | 2 | 7.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 231 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2120 | |
| Common | 263 | 11.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 193 | 9.1% |
| r | 146 | 6.9% |
| n | 144 | 6.8% |
| o | 141 | 6.7% |
| t | 126 | 5.9% |
| a | 125 | 5.9% |
| i | 122 | 5.8% |
| s | 106 | 5.0% |
| A | 85 | 4.0% |
| l | 72 | 3.4% |
| Other values (40) | 860 |
Common
| Value | Count | Frequency (%) |
| 231 | ||
| . | 9 | 3.4% |
| & | 8 | 3.0% |
| , | 4 | 1.5% |
| ' | 3 | 1.1% |
| + | 2 | 0.8% |
| 2 | 2 | 0.8% |
| / | 2 | 0.8% |
| - | 1 | 0.4% |
| 4 | 1 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2383 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 231 | 9.7% | |
| e | 193 | 8.1% |
| r | 146 | 6.1% |
| n | 144 | 6.0% |
| o | 141 | 5.9% |
| t | 126 | 5.3% |
| a | 125 | 5.2% |
| i | 122 | 5.1% |
| s | 106 | 4.4% |
| A | 85 | 3.6% |
| Other values (50) | 964 |
first_name
Text
| Distinct | 749 |
|---|---|
| Distinct (%) | 74.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.8 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 6.087 |
| Min length | 2 |
Characters and Unicode
| Total characters | 6087 |
|---|---|
| Distinct characters | 59 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 608 ? |
|---|---|
| Unique (%) | 60.8% |
Sample
| 1st row | Albert |
|---|---|
| 2nd row | Todd |
| 3rd row | Gena |
| 4th row | Lorris |
| 5th row | Andrea |
| Value | Count | Frequency (%) |
| anthony | 8 | 0.8% |
| ann | 7 | 0.7% |
| jennifer | 7 | 0.7% |
| muhammad | 7 | 0.7% |
| andrew | 7 | 0.7% |
| maria | 7 | 0.7% |
| amy | 6 | 0.6% |
| michael | 6 | 0.6% |
| anne | 6 | 0.6% |
| jose | 6 | 0.6% |
| Other values (735) | 955 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 815 | 13.4% |
| e | 559 | 9.2% |
| n | 543 | 8.9% |
| i | 446 | 7.3% |
| r | 386 | 6.3% |
| l | 318 | 5.2% |
| A | 263 | 4.3% |
| o | 251 | 4.1% |
| h | 204 | 3.4% |
| t | 179 | 2.9% |
| Other values (49) | 2123 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4864 | |
| Uppercase Letter | 1182 | 19.4% |
| Space Separator | 22 | 0.4% |
| Open Punctuation | 5 | 0.1% |
| Close Punctuation | 5 | 0.1% |
| Dash Punctuation | 4 | 0.1% |
| Other Punctuation | 4 | 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 815 | |
| e | 559 | |
| n | 543 | |
| i | 446 | |
| r | 386 | 7.9% |
| l | 318 | 6.5% |
| o | 251 | 5.2% |
| h | 204 | 4.2% |
| t | 179 | 3.7% |
| d | 177 | 3.6% |
| Other values (16) | 986 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 263 | |
| M | 110 | 9.3% |
| S | 91 | 7.7% |
| J | 89 | 7.5% |
| R | 64 | 5.4% |
| L | 55 | 4.7% |
| D | 54 | 4.6% |
| N | 49 | 4.1% |
| C | 48 | 4.1% |
| E | 43 | 3.6% |
| Other values (16) | 316 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| ' | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 22 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6046 | |
| Common | 41 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 815 | 13.5% |
| e | 559 | 9.2% |
| n | 543 | 9.0% |
| i | 446 | 7.4% |
| r | 386 | 6.4% |
| l | 318 | 5.3% |
| A | 263 | 4.3% |
| o | 251 | 4.2% |
| h | 204 | 3.4% |
| t | 179 | 3.0% |
| Other values (42) | 2082 |
Common
| Value | Count | Frequency (%) |
| 22 | ||
| ( | 5 | 12.2% |
| ) | 5 | 12.2% |
| - | 4 | 9.8% |
| . | 3 | 7.3% |
| ' | 1 | 2.4% |
| ´ | 1 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6086 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 815 | 13.4% |
| e | 559 | 9.2% |
| n | 543 | 8.9% |
| i | 446 | 7.3% |
| r | 386 | 6.3% |
| l | 318 | 5.2% |
| A | 263 | 4.3% |
| o | 251 | 4.1% |
| h | 204 | 3.4% |
| t | 179 | 2.9% |
| Other values (48) | 2122 |
None
| Value | Count | Frequency (%) |
| ´ | 1 |
last_name
Text
| Distinct | 884 |
|---|---|
| Distinct (%) | 88.5% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 62.3 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 19 |
| Mean length | 6.726726727 |
| Min length | 2 |
Characters and Unicode
| Total characters | 6720 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 819 ? |
|---|---|
| Unique (%) | 82.0% |
Sample
| 1st row | Jethanamest |
|---|---|
| 2nd row | Kilduff |
| 3rd row | Surphlis |
| 4th row | Alleyne |
| 5th row | David |
| Value | Count | Frequency (%) |
| singh | 15 | 1.5% |
| williams | 9 | 0.9% |
| khan | 5 | 0.5% |
| smith | 5 | 0.5% |
| patel | 5 | 0.5% |
| gonzalez | 5 | 0.5% |
| torres | 5 | 0.5% |
| de | 5 | 0.5% |
| perez | 4 | 0.4% |
| jr | 4 | 0.4% |
| Other values (880) | 968 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 731 | 10.9% |
| e | 552 | 8.2% |
| r | 468 | 7.0% |
| i | 462 | 6.9% |
| n | 458 | 6.8% |
| o | 442 | 6.6% |
| l | 346 | 5.1% |
| s | 274 | 4.1% |
| t | 223 | 3.3% |
| h | 213 | 3.2% |
| Other values (47) | 2551 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5369 | |
| Uppercase Letter | 1268 | 18.9% |
| Space Separator | 32 | 0.5% |
| Dash Punctuation | 29 | 0.4% |
| Other Punctuation | 22 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 731 | |
| e | 552 | |
| r | 468 | 8.7% |
| i | 462 | 8.6% |
| n | 458 | 8.5% |
| o | 442 | 8.2% |
| l | 346 | 6.4% |
| s | 274 | 5.1% |
| t | 223 | 4.2% |
| h | 213 | 4.0% |
| Other values (16) | 1200 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 129 | 10.2% |
| S | 111 | 8.8% |
| M | 100 | 7.9% |
| C | 90 | 7.1% |
| B | 73 | 5.8% |
| P | 70 | 5.5% |
| R | 67 | 5.3% |
| L | 67 | 5.3% |
| G | 59 | 4.7% |
| E | 59 | 4.7% |
| Other values (16) | 443 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11 | |
| , | 8 | |
| ' | 3 | 13.6% |
Space Separator
| Value | Count | Frequency (%) |
| 32 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 29 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6637 | |
| Common | 83 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 731 | 11.0% |
| e | 552 | 8.3% |
| r | 468 | 7.1% |
| i | 462 | 7.0% |
| n | 458 | 6.9% |
| o | 442 | 6.7% |
| l | 346 | 5.2% |
| s | 274 | 4.1% |
| t | 223 | 3.4% |
| h | 213 | 3.2% |
| Other values (42) | 2468 |
Common
| Value | Count | Frequency (%) |
| 32 | ||
| - | 29 | |
| . | 11 | 13.3% |
| , | 8 | 9.6% |
| ' | 3 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 731 | 10.9% |
| e | 552 | 8.2% |
| r | 468 | 7.0% |
| i | 462 | 6.9% |
| n | 458 | 6.8% |
| o | 442 | 6.6% |
| l | 346 | 5.1% |
| s | 274 | 4.1% |
| t | 223 | 3.3% |
| h | 213 | 3.2% |
| Other values (47) | 2551 |
telephone
Text
| Distinct | 994 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 10000 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 988 ? |
|---|---|
| Unique (%) | 98.8% |
Sample
| 1st row | 6463879761 |
|---|---|
| 2nd row | 2019939696 |
| 3rd row | 3479035447 |
| 4th row | 3474690806 |
| 5th row | 7183008023 |
| Value | Count | Frequency (%) |
| 7189229404 | 2 | 0.2% |
| 2126636288 | 2 | 0.2% |
| 6316175951 | 2 | 0.2% |
| 7183891777 | 2 | 0.2% |
| 6465094601 | 2 | 0.2% |
| 7185569700 | 2 | 0.2% |
| 9173742930 | 1 | 0.1% |
| 9293937973 | 1 | 0.1% |
| 9142588462 | 1 | 0.1% |
| 9145863470 | 1 | 0.1% |
| Other values (984) | 984 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1221 | |
| 7 | 1190 | |
| 2 | 1067 | |
| 6 | 1028 | |
| 8 | 960 | |
| 4 | 948 | |
| 0 | 925 | |
| 3 | 909 | |
| 5 | 880 | |
| 9 | 872 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1221 | |
| 7 | 1190 | |
| 2 | 1067 | |
| 6 | 1028 | |
| 8 | 960 | |
| 4 | 948 | |
| 0 | 925 | |
| 3 | 909 | |
| 5 | 880 | |
| 9 | 872 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1221 | |
| 7 | 1190 | |
| 2 | 1067 | |
| 6 | 1028 | |
| 8 | 960 | |
| 4 | 948 | |
| 0 | 925 | |
| 3 | 909 | |
| 5 | 880 | |
| 9 | 872 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1221 | |
| 7 | 1190 | |
| 2 | 1067 | |
| 6 | 1028 | |
| 8 | 960 | |
| 4 | 948 | |
| 0 | 925 | |
| 3 | 909 | |
| 5 | 880 | |
| 9 | 872 |
email
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1000 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 7.9 KiB |
| Distinct | 998 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 421.2 KiB |
Length
| Max length | 4738 |
|---|---|
| Median length | 534 |
| Mean length | 303.6996997 |
| Min length | 5 |
Characters and Unicode
| Total characters | 303396 |
|---|---|
| Distinct characters | 100 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 997 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | Zenyai Viet Cajun & Pho Restaurant is dedicated to offering real Vietnamese flavor through distinct seafood boils and pho noodle dishes. |
|---|---|
| 2nd row | Kilduff Underground Engineering, Inc. (KUE) is a geotechnical engineering firm specializing in tunnels, underground design, and construction management throughout North America. Serving as both a design and construction management firm, our strengths lie in our ability to be equally familiar with the design and construction aspects of underground projects. |
| 3rd row | 024™ is a premium home fragrance brand that designs elevated home fragrance products that contain patented scent technology. 024™'s fragrance line eliminates common lingering odors while infusing the space with immersive and captivating scents. |
| 4th row | Our Services include Office Cleaning Carpet cleaning,Floor Stripping and Waxing and General building Maintenance.; ; BUILDING MAINTENANCE; CARPET CLEANING; CLEANING SERVICES; JANITORIAL SERVICES |
| 5th row | NYS Licensed Home Health Agency |
| Value | Count | Frequency (%) |
| and | 2691 | 6.4% |
| the | 905 | 2.1% |
| of | 821 | 1.9% |
| to | 794 | 1.9% |
| a | 701 | 1.7% |
| in | 674 | 1.6% |
| services | 586 | 1.4% |
| we | 543 | 1.3% |
| is | 485 | 1.2% |
| 447 | 1.1% | |
| Other values (6996) | 33506 |
Most occurring characters
| Value | Count | Frequency (%) |
| 41054 | ||
| e | 26721 | 8.8% |
| i | 20885 | 6.9% |
| n | 20755 | 6.8% |
| a | 19373 | 6.4% |
| t | 18407 | 6.1% |
| o | 17312 | 5.7% |
| s | 17177 | 5.7% |
| r | 16930 | 5.6% |
| c | 10580 | 3.5% |
| Other values (90) | 94202 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 235796 | |
| Space Separator | 41079 | 13.5% |
| Uppercase Letter | 15312 | 5.0% |
| Other Punctuation | 8041 | 2.7% |
| Decimal Number | 1155 | 0.4% |
| Control | 770 | 0.3% |
| Dash Punctuation | 719 | 0.2% |
| Open Punctuation | 201 | 0.1% |
| Close Punctuation | 200 | 0.1% |
| Final Punctuation | 70 | < 0.1% |
| Other values (5) | 53 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 26721 | |
| i | 20885 | 8.9% |
| n | 20755 | 8.8% |
| a | 19373 | 8.2% |
| t | 18407 | 7.8% |
| o | 17312 | 7.3% |
| s | 17177 | 7.3% |
| r | 16930 | 7.2% |
| c | 10580 | 4.5% |
| l | 10557 | 4.5% |
| Other values (19) | 57099 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1795 | 11.7% |
| C | 1449 | 9.5% |
| S | 1307 | 8.5% |
| E | 1031 | 6.7% |
| I | 995 | 6.5% |
| T | 933 | 6.1% |
| N | 752 | 4.9% |
| P | 738 | 4.8% |
| W | 723 | 4.7% |
| O | 686 | 4.5% |
| Other values (16) | 4903 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4449 | |
| . | 2408 | |
| / | 290 | 3.6% |
| & | 277 | 3.4% |
| : | 152 | 1.9% |
| ; | 139 | 1.7% |
| ' | 94 | 1.2% |
| • | 90 | 1.1% |
| ? | 55 | 0.7% |
| " | 24 | 0.3% |
| Other values (7) | 63 | 0.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 332 | |
| 1 | 191 | |
| 2 | 177 | |
| 3 | 89 | 7.7% |
| 5 | 77 | 6.7% |
| 9 | 76 | 6.6% |
| 4 | 73 | 6.3% |
| 7 | 48 | 4.2% |
| 6 | 47 | 4.1% |
| 8 | 45 | 3.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 697 | |
| – | 16 | 2.2% |
| — | 6 | 0.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 22 | |
| | | 1 | 4.2% |
| = | 1 | 4.2% |
Space Separator
| Value | Count | Frequency (%) |
| 41054 | ||
| 25 | 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 60 | |
| ” | 10 | 14.3% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 4 | |
| ® | 1 | 20.0% |
Control
| Value | Count | Frequency (%) |
| 770 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 201 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 200 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 13 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 10 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 251108 | |
| Common | 52288 | 17.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 26721 | 10.6% |
| i | 20885 | 8.3% |
| n | 20755 | 8.3% |
| a | 19373 | 7.7% |
| t | 18407 | 7.3% |
| o | 17312 | 6.9% |
| s | 17177 | 6.8% |
| r | 16930 | 6.7% |
| c | 10580 | 4.2% |
| l | 10557 | 4.2% |
| Other values (45) | 72411 |
Common
| Value | Count | Frequency (%) |
| 41054 | ||
| , | 4449 | 8.5% |
| . | 2408 | 4.6% |
| 770 | 1.5% | |
| - | 697 | 1.3% |
| 0 | 332 | 0.6% |
| / | 290 | 0.6% |
| & | 277 | 0.5% |
| ( | 201 | 0.4% |
| ) | 200 | 0.4% |
| Other values (35) | 1610 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 303145 | |
| Punctuation | 195 | 0.1% |
| None | 52 | < 0.1% |
| Letterlike Symbols | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 41054 | ||
| e | 26721 | 8.8% |
| i | 20885 | 6.9% |
| n | 20755 | 6.8% |
| a | 19373 | 6.4% |
| t | 18407 | 6.1% |
| o | 17312 | 5.7% |
| s | 17177 | 5.7% |
| r | 16930 | 5.6% |
| c | 10580 | 3.5% |
| Other values (76) | 93951 |
Punctuation
| Value | Count | Frequency (%) |
| • | 90 | |
| ’ | 60 | |
| – | 16 | 8.2% |
| “ | 13 | 6.7% |
| ” | 10 | 5.1% |
| — | 6 | 3.1% |
None
| Value | Count | Frequency (%) |
| 25 | ||
| · | 16 | |
| ¿ | 4 | 7.7% |
| ç | 3 | 5.8% |
| é | 2 | 3.8% |
| à | 1 | 1.9% |
| ® | 1 | 1.9% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 4 |
certification
Text
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.7 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 3 |
| Mean length | 4.012 |
| Min length | 3 |
Characters and Unicode
| Total characters | 4012 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | MBE |
|---|---|
| 2nd row | MBE |
| 3rd row | MBE,WBE |
| 4th row | MBE |
| 5th row | MBE,WBE |
| Value | Count | Frequency (%) |
| mbe | 478 | |
| wbe | 270 | |
| mbe,wbe | 248 | |
| mbe,ebe | 2 | 0.2% |
| mbe,wbe,ebe | 1 | 0.1% |
| mbe,lbe | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1256 | |
| B | 1253 | |
| M | 730 | |
| W | 519 | |
| , | 253 | 6.3% |
| L | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3759 | |
| Other Punctuation | 253 | 6.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1256 | |
| B | 1253 | |
| M | 730 | |
| W | 519 | |
| L | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 253 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3759 | |
| Common | 253 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1256 | |
| B | 1253 | |
| M | 730 | |
| W | 519 | |
| L | 1 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| , | 253 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4012 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1256 | |
| B | 1253 | |
| M | 730 | |
| W | 519 | |
| , | 253 | 6.3% |
| L | 1 | < 0.1% |
MISSING 
| Distinct | 134 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 270 |
| Missing (%) | 27.0% |
| Memory size | 58.7 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 9 |
| Mean length | 13.30821918 |
| Min length | 8 |
Characters and Unicode
| Total characters | 9715 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | 10/31/2025 |
|---|---|
| 2nd row | 7/31/2025 |
| 3rd row | 04/30/2026;04/30/2026 |
| 4th row | 3/31/2026 |
| 5th row | 03/31/2027;03/31/2027 |
| Value | Count | Frequency (%) |
| 6/30/2024 | 22 | 3.0% |
| 6/30/2026 | 17 | 2.3% |
| 10/31/2024 | 15 | 2.1% |
| 4/30/2028 | 12 | 1.6% |
| 2/29/2024 | 12 | 1.6% |
| 5/31/2028 | 12 | 1.6% |
| 8/31/2025 | 12 | 1.6% |
| 11/30/2026 | 11 | 1.5% |
| 3/31/2024 | 11 | 1.5% |
| 7/31/2025 | 11 | 1.5% |
| Other values (124) | 595 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2202 | |
| / | 1966 | |
| 0 | 1807 | |
| 3 | 1048 | |
| 1 | 924 | |
| 8 | 331 | 3.4% |
| 4 | 327 | 3.4% |
| 6 | 271 | 2.8% |
| ; | 253 | 2.6% |
| 7 | 227 | 2.3% |
| Other values (2) | 359 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7496 | |
| Other Punctuation | 2219 | 22.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2202 | |
| 0 | 1807 | |
| 3 | 1048 | |
| 1 | 924 | |
| 8 | 331 | 4.4% |
| 4 | 327 | 4.4% |
| 6 | 271 | 3.6% |
| 7 | 227 | 3.0% |
| 5 | 225 | 3.0% |
| 9 | 134 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1966 | |
| ; | 253 | 11.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9715 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2202 | |
| / | 1966 | |
| 0 | 1807 | |
| 3 | 1048 | |
| 1 | 924 | |
| 8 | 331 | 3.4% |
| 4 | 327 | 3.4% |
| 6 | 271 | 2.8% |
| ; | 253 | 2.6% |
| 7 | 227 | 2.3% |
| Other values (2) | 359 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2202 | |
| / | 1966 | |
| 0 | 1807 | |
| 3 | 1048 | |
| 1 | 924 | |
| 8 | 331 | 3.4% |
| 4 | 327 | 3.4% |
| 6 | 271 | 2.8% |
| ; | 253 | 2.6% |
| 7 | 227 | 2.3% |
| Other values (2) | 359 | 3.7% |
ethnicity
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 63.1 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 5 |
| Mean length | 7.442 |
| Min length | 5 |
Characters and Unicode
| Total characters | 7442 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ASIAN |
|---|---|
| 2nd row | HISPANIC |
| 3rd row | BLACK |
| 4th row | BLACK |
| 5th row | BLACK |
| Value | Count | Frequency (%) |
| black | 299 | |
| non-minority | 270 | |
| asian | 247 | |
| hispanic | 184 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1241 | |
| I | 1155 | |
| A | 977 | |
| O | 540 | 7.3% |
| C | 483 | 6.5% |
| S | 431 | 5.8% |
| B | 299 | 4.0% |
| L | 299 | 4.0% |
| K | 299 | 4.0% |
| - | 270 | 3.6% |
| Other values (6) | 1448 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7172 | |
| Dash Punctuation | 270 | 3.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1241 | |
| I | 1155 | |
| A | 977 | |
| O | 540 | |
| C | 483 | 6.7% |
| S | 431 | 6.0% |
| B | 299 | 4.2% |
| L | 299 | 4.2% |
| K | 299 | 4.2% |
| M | 270 | 3.8% |
| Other values (5) | 1178 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 270 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7172 | |
| Common | 270 | 3.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1241 | |
| I | 1155 | |
| A | 977 | |
| O | 540 | |
| C | 483 | 6.7% |
| S | 431 | 6.0% |
| B | 299 | 4.2% |
| L | 299 | 4.2% |
| K | 299 | 4.2% |
| M | 270 | 3.8% |
| Other values (5) | 1178 |
Common
| Value | Count | Frequency (%) |
| - | 270 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7442 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1241 | |
| I | 1155 | |
| A | 977 | |
| O | 540 | 7.3% |
| C | 483 | 6.5% |
| S | 431 | 5.8% |
| B | 299 | 4.0% |
| L | 299 | 4.0% |
| K | 299 | 4.0% |
| - | 270 | 3.6% |
| Other values (6) | 1448 |
address1
Text
| Distinct | 990 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 3 |
| Missing (%) | 0.3% |
| Memory size | 73.4 KiB |
Length
| Max length | 44 |
|---|---|
| Median length | 32 |
| Mean length | 18.1444333 |
| Min length | 8 |
Characters and Unicode
| Total characters | 18090 |
|---|---|
| Distinct characters | 68 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 983 ? |
|---|---|
| Unique (%) | 98.6% |
Sample
| 1st row | 208 Grand Street |
|---|---|
| 2nd row | 9 Globe Ct |
| 3rd row | 120 Elgar Place |
| 4th row | 946 Atlantic Ave |
| 5th row | 148 George Street |
| Value | Count | Frequency (%) |
| street | 291 | 8.6% |
| avenue | 233 | 6.9% |
| ave | 96 | 2.8% |
| road | 81 | 2.4% |
| east | 57 | 1.7% |
| st | 57 | 1.7% |
| west | 50 | 1.5% |
| drive | 33 | 1.0% |
| blvd | 32 | 0.9% |
| rd | 28 | 0.8% |
| Other values (1504) | 2431 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2397 | 13.3% | |
| e | 1766 | 9.8% |
| t | 1297 | 7.2% |
| r | 774 | 4.3% |
| 1 | 738 | 4.1% |
| a | 671 | 3.7% |
| n | 657 | 3.6% |
| 2 | 526 | 2.9% |
| o | 517 | 2.9% |
| 0 | 503 | 2.8% |
| Other values (58) | 8244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9139 | |
| Decimal Number | 3967 | |
| Space Separator | 2397 | 13.3% |
| Uppercase Letter | 2313 | 12.8% |
| Dash Punctuation | 144 | 0.8% |
| Other Punctuation | 130 | 0.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 461 | |
| A | 405 | |
| R | 167 | 7.2% |
| E | 142 | 6.1% |
| W | 134 | 5.8% |
| B | 123 | 5.3% |
| P | 104 | 4.5% |
| C | 91 | 3.9% |
| L | 77 | 3.3% |
| D | 73 | 3.2% |
| Other values (16) | 536 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1766 | |
| t | 1297 | |
| r | 774 | |
| a | 671 | 7.3% |
| n | 657 | 7.2% |
| o | 517 | 5.7% |
| v | 426 | 4.7% |
| d | 401 | 4.4% |
| i | 391 | 4.3% |
| u | 378 | 4.1% |
| Other values (15) | 1861 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 738 | |
| 2 | 526 | |
| 0 | 503 | |
| 3 | 445 | |
| 5 | 387 | |
| 4 | 347 | |
| 6 | 305 | |
| 9 | 247 | 6.2% |
| 8 | 236 | 5.9% |
| 7 | 233 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 59 | |
| , | 47 | |
| # | 21 | 16.2% |
| ' | 2 | 1.5% |
| / | 1 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 2397 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 144 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11452 | |
| Common | 6638 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1766 | |
| t | 1297 | 11.3% |
| r | 774 | 6.8% |
| a | 671 | 5.9% |
| n | 657 | 5.7% |
| o | 517 | 4.5% |
| S | 461 | 4.0% |
| v | 426 | 3.7% |
| A | 405 | 3.5% |
| d | 401 | 3.5% |
| Other values (41) | 4077 |
Common
| Value | Count | Frequency (%) |
| 2397 | ||
| 1 | 738 | 11.1% |
| 2 | 526 | 7.9% |
| 0 | 503 | 7.6% |
| 3 | 445 | 6.7% |
| 5 | 387 | 5.8% |
| 4 | 347 | 5.2% |
| 6 | 305 | 4.6% |
| 9 | 247 | 3.7% |
| 8 | 236 | 3.6% |
| Other values (7) | 507 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18090 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2397 | 13.3% | |
| e | 1766 | 9.8% |
| t | 1297 | 7.2% |
| r | 774 | 4.3% |
| 1 | 738 | 4.1% |
| a | 671 | 3.7% |
| n | 657 | 3.6% |
| 2 | 526 | 2.9% |
| o | 517 | 2.9% |
| 0 | 503 | 2.8% |
| Other values (58) | 8244 |
address2
Text
MISSING 
| Distinct | 310 |
|---|---|
| Distinct (%) | 81.2% |
| Missing | 618 |
| Missing (%) | 61.8% |
| Memory size | 43.7 KiB |
Length
| Max length | 34 |
|---|---|
| Median length | 32 |
| Mean length | 7.916230366 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3024 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 268 ? |
|---|---|
| Unique (%) | 70.2% |
Sample
| 1st row | #23D |
|---|---|
| 2nd row | Apt 4C |
| 3rd row | Suite 203 |
| 4th row | Ground Floor |
| 5th row | Suite 5 |
| Value | Count | Frequency (%) |
| suite | 162 | |
| floor | 62 | 8.3% |
| apt | 46 | 6.2% |
| unit | 17 | 2.3% |
| 1 | 17 | 2.3% |
| 14 | 1.9% | |
| 2 | 13 | 1.7% |
| 1st | 12 | 1.6% |
| 3rd | 10 | 1.3% |
| 3 | 9 | 1.2% |
| Other values (263) | 382 |
Most occurring characters
| Value | Count | Frequency (%) |
| 362 | 12.0% | |
| t | 283 | 9.4% |
| i | 192 | 6.3% |
| e | 192 | 6.3% |
| 1 | 176 | 5.8% |
| S | 172 | 5.7% |
| u | 172 | 5.7% |
| o | 150 | 5.0% |
| 0 | 120 | 4.0% |
| 2 | 108 | 3.6% |
| Other values (52) | 1097 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1378 | |
| Decimal Number | 708 | |
| Uppercase Letter | 507 | 16.8% |
| Space Separator | 362 | 12.0% |
| Other Punctuation | 68 | 2.2% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 283 | |
| i | 192 | |
| e | 192 | |
| u | 172 | |
| o | 150 | |
| r | 98 | 7.1% |
| l | 73 | 5.3% |
| p | 44 | 3.2% |
| n | 44 | 3.2% |
| h | 31 | 2.2% |
| Other values (13) | 99 | 7.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 172 | |
| F | 86 | |
| A | 66 | 13.0% |
| B | 21 | 4.1% |
| U | 20 | 3.9% |
| C | 19 | 3.7% |
| D | 17 | 3.4% |
| L | 13 | 2.6% |
| P | 12 | 2.4% |
| N | 11 | 2.2% |
| Other values (12) | 70 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 176 | |
| 0 | 120 | |
| 2 | 108 | |
| 3 | 78 | |
| 4 | 66 | 9.3% |
| 6 | 50 | 7.1% |
| 5 | 48 | 6.8% |
| 7 | 27 | 3.8% |
| 8 | 19 | 2.7% |
| 9 | 16 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 45 | |
| . | 10 | 14.7% |
| , | 9 | 13.2% |
| / | 3 | 4.4% |
| & | 1 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 362 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1885 | |
| Common | 1139 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 283 | |
| i | 192 | |
| e | 192 | |
| S | 172 | |
| u | 172 | |
| o | 150 | 8.0% |
| r | 98 | 5.2% |
| F | 86 | 4.6% |
| l | 73 | 3.9% |
| A | 66 | 3.5% |
| Other values (35) | 401 |
Common
| Value | Count | Frequency (%) |
| 362 | ||
| 1 | 176 | |
| 0 | 120 | 10.5% |
| 2 | 108 | 9.5% |
| 3 | 78 | 6.8% |
| 4 | 66 | 5.8% |
| 6 | 50 | 4.4% |
| 5 | 48 | 4.2% |
| # | 45 | 4.0% |
| 7 | 27 | 2.4% |
| Other values (7) | 59 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3024 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 362 | 12.0% | |
| t | 283 | 9.4% |
| i | 192 | 6.3% |
| e | 192 | 6.3% |
| 1 | 176 | 5.8% |
| S | 172 | 5.7% |
| u | 172 | 5.7% |
| o | 150 | 5.0% |
| 0 | 120 | 4.0% |
| 2 | 108 | 3.6% |
| Other values (52) | 1097 |
city
Text
| Distinct | 307 |
|---|---|
| Distinct (%) | 30.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 64.7 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 9.121 |
| Min length | 4 |
Characters and Unicode
| Total characters | 9121 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 197 ? |
|---|---|
| Unique (%) | 19.7% |
Sample
| 1st row | Brooklyn |
|---|---|
| 2nd row | Red Bank |
| 3rd row | Bronx |
| 4th row | Brooklyn |
| 5th row | Brooklyn |
| Value | Count | Frequency (%) |
| new | 188 | 12.9% |
| brooklyn | 183 | 12.6% |
| york | 173 | 11.9% |
| bronx | 66 | 4.5% |
| island | 51 | 3.5% |
| city | 31 | 2.1% |
| staten | 29 | 2.0% |
| park | 27 | 1.9% |
| jamaica | 22 | 1.5% |
| long | 21 | 1.4% |
| Other values (303) | 665 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 899 | 9.9% |
| r | 687 | 7.5% |
| n | 687 | 7.5% |
| e | 687 | 7.5% |
| l | 540 | 5.9% |
| a | 505 | 5.5% |
| k | 458 | 5.0% |
| 456 | 5.0% | |
| i | 338 | 3.7% |
| t | 322 | 3.5% |
| Other values (42) | 3542 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6909 | |
| Uppercase Letter | 1748 | 19.2% |
| Space Separator | 456 | 5.0% |
| Other Punctuation | 8 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 899 | |
| r | 687 | |
| n | 687 | |
| e | 687 | |
| l | 540 | 7.8% |
| a | 505 | 7.3% |
| k | 458 | 6.6% |
| i | 338 | 4.9% |
| t | 322 | 4.7% |
| s | 314 | 4.5% |
| Other values (15) | 1472 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 294 | |
| N | 226 | |
| Y | 196 | 11.2% |
| S | 107 | 6.1% |
| R | 81 | 4.6% |
| H | 73 | 4.2% |
| I | 68 | 3.9% |
| M | 66 | 3.8% |
| L | 65 | 3.7% |
| C | 65 | 3.7% |
| Other values (15) | 507 |
Space Separator
| Value | Count | Frequency (%) |
| 456 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8657 | |
| Common | 464 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 899 | 10.4% |
| r | 687 | 7.9% |
| n | 687 | 7.9% |
| e | 687 | 7.9% |
| l | 540 | 6.2% |
| a | 505 | 5.8% |
| k | 458 | 5.3% |
| i | 338 | 3.9% |
| t | 322 | 3.7% |
| s | 314 | 3.6% |
| Other values (40) | 3220 |
Common
| Value | Count | Frequency (%) |
| 456 | ||
| . | 8 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9121 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 899 | 9.9% |
| r | 687 | 7.5% |
| n | 687 | 7.5% |
| e | 687 | 7.5% |
| l | 540 | 5.9% |
| a | 505 | 5.5% |
| k | 458 | 5.0% |
| 456 | 5.0% | |
| i | 338 | 3.7% |
| t | 322 | 3.5% |
| Other values (42) | 3542 |
state
Text
| Distinct | 21 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.7 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2000 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | NY |
|---|---|
| 2nd row | NJ |
| 3rd row | NY |
| 4th row | NY |
| 5th row | NY |
| Value | Count | Frequency (%) |
| ny | 872 | |
| nj | 91 | 9.1% |
| pa | 7 | 0.7% |
| ga | 4 | 0.4% |
| ct | 3 | 0.3% |
| il | 3 | 0.3% |
| va | 2 | 0.2% |
| ca | 2 | 0.2% |
| md | 2 | 0.2% |
| ma | 2 | 0.2% |
| Other values (11) | 12 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 966 | |
| Y | 872 | |
| J | 91 | 4.5% |
| A | 17 | 0.9% |
| C | 9 | 0.4% |
| P | 7 | 0.4% |
| M | 6 | 0.3% |
| I | 6 | 0.3% |
| D | 4 | 0.2% |
| L | 4 | 0.2% |
| Other values (9) | 18 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 966 | |
| Y | 872 | |
| J | 91 | 4.5% |
| A | 17 | 0.9% |
| C | 9 | 0.4% |
| P | 7 | 0.4% |
| M | 6 | 0.3% |
| I | 6 | 0.3% |
| D | 4 | 0.2% |
| L | 4 | 0.2% |
| Other values (9) | 18 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 966 | |
| Y | 872 | |
| J | 91 | 4.5% |
| A | 17 | 0.9% |
| C | 9 | 0.4% |
| P | 7 | 0.4% |
| M | 6 | 0.3% |
| I | 6 | 0.3% |
| D | 4 | 0.2% |
| L | 4 | 0.2% |
| Other values (9) | 18 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 966 | |
| Y | 872 | |
| J | 91 | 4.5% |
| A | 17 | 0.9% |
| C | 9 | 0.4% |
| P | 7 | 0.4% |
| M | 6 | 0.3% |
| I | 6 | 0.3% |
| D | 4 | 0.2% |
| L | 4 | 0.2% |
| Other values (9) | 18 | 0.9% |
zip
Text
| Distinct | 410 |
|---|---|
| Distinct (%) | 41.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 60.6 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.902 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4902 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 206 ? |
|---|---|
| Unique (%) | 20.6% |
Sample
| 1st row | 11211 |
|---|---|
| 2nd row | 7701 |
| 3rd row | 10475 |
| 4th row | 11238 |
| 5th row | 11237 |
| Value | Count | Frequency (%) |
| 10001 | 19 | 1.9% |
| 11101 | 17 | 1.7% |
| 11201 | 13 | 1.3% |
| 10018 | 11 | 1.1% |
| 11230 | 11 | 1.1% |
| 11238 | 10 | 1.0% |
| 11432 | 9 | 0.9% |
| 11801 | 9 | 0.9% |
| 11226 | 9 | 0.9% |
| 11413 | 8 | 0.8% |
| Other values (400) | 884 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1788 | |
| 0 | 904 | |
| 2 | 450 | 9.2% |
| 3 | 351 | 7.2% |
| 7 | 320 | 6.5% |
| 4 | 288 | 5.9% |
| 5 | 284 | 5.8% |
| 6 | 224 | 4.6% |
| 8 | 169 | 3.4% |
| 9 | 124 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4902 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1788 | |
| 0 | 904 | |
| 2 | 450 | 9.2% |
| 3 | 351 | 7.2% |
| 7 | 320 | 6.5% |
| 4 | 288 | 5.9% |
| 5 | 284 | 5.8% |
| 6 | 224 | 4.6% |
| 8 | 169 | 3.4% |
| 9 | 124 | 2.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4902 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1788 | |
| 0 | 904 | |
| 2 | 450 | 9.2% |
| 3 | 351 | 7.2% |
| 7 | 320 | 6.5% |
| 4 | 288 | 5.9% |
| 5 | 284 | 5.8% |
| 6 | 224 | 4.6% |
| 8 | 169 | 3.4% |
| 9 | 124 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4902 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1788 | |
| 0 | 904 | |
| 2 | 450 | 9.2% |
| 3 | 351 | 7.2% |
| 7 | 320 | 6.5% |
| 4 | 288 | 5.9% |
| 5 | 284 | 5.8% |
| 6 | 224 | 4.6% |
| 8 | 169 | 3.4% |
| 9 | 124 | 2.5% |
mailingaddress1
Text
| Distinct | 991 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 73.3 KiB |
Length
| Max length | 37 |
|---|---|
| Median length | 31 |
| Mean length | 17.92492492 |
| Min length | 8 |
Characters and Unicode
| Total characters | 17907 |
|---|---|
| Distinct characters | 66 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 983 ? |
|---|---|
| Unique (%) | 98.4% |
Sample
| 1st row | 208 Grand Street |
|---|---|
| 2nd row | 9 Globe Ct |
| 3rd row | 120 Elgar Place |
| 4th row | 946 Atlantic Ave |
| 5th row | 148 George Street |
| Value | Count | Frequency (%) |
| street | 281 | 8.3% |
| avenue | 232 | 6.9% |
| ave | 88 | 2.6% |
| road | 81 | 2.4% |
| east | 61 | 1.8% |
| st | 55 | 1.6% |
| west | 50 | 1.5% |
| drive | 37 | 1.1% |
| box | 32 | 0.9% |
| blvd | 31 | 0.9% |
| Other values (1506) | 2428 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2382 | 13.3% | |
| e | 1718 | 9.6% |
| t | 1243 | 6.9% |
| r | 759 | 4.2% |
| 1 | 740 | 4.1% |
| a | 654 | 3.7% |
| n | 645 | 3.6% |
| o | 526 | 2.9% |
| 2 | 512 | 2.9% |
| 0 | 509 | 2.8% |
| Other values (56) | 8219 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8953 | |
| Decimal Number | 3971 | |
| Space Separator | 2382 | 13.3% |
| Uppercase Letter | 2333 | 13.0% |
| Dash Punctuation | 137 | 0.8% |
| Other Punctuation | 131 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1718 | |
| t | 1243 | |
| r | 759 | |
| a | 654 | 7.3% |
| n | 645 | 7.2% |
| o | 526 | 5.9% |
| v | 423 | 4.7% |
| d | 389 | 4.3% |
| i | 388 | 4.3% |
| u | 365 | 4.1% |
| Other values (15) | 1843 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 441 | |
| A | 394 | |
| R | 167 | 7.2% |
| B | 142 | 6.1% |
| E | 138 | 5.9% |
| P | 133 | 5.7% |
| W | 128 | 5.5% |
| C | 80 | 3.4% |
| D | 79 | 3.4% |
| L | 77 | 3.3% |
| Other values (15) | 554 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 740 | |
| 2 | 512 | |
| 0 | 509 | |
| 3 | 436 | |
| 5 | 389 | |
| 4 | 350 | |
| 6 | 326 | |
| 9 | 245 | 6.2% |
| 7 | 238 | 6.0% |
| 8 | 226 | 5.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 70 | |
| , | 39 | |
| # | 20 | 15.3% |
| ' | 2 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 2382 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 137 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11286 | |
| Common | 6621 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1718 | |
| t | 1243 | 11.0% |
| r | 759 | 6.7% |
| a | 654 | 5.8% |
| n | 645 | 5.7% |
| o | 526 | 4.7% |
| S | 441 | 3.9% |
| v | 423 | 3.7% |
| A | 394 | 3.5% |
| d | 389 | 3.4% |
| Other values (40) | 4094 |
Common
| Value | Count | Frequency (%) |
| 2382 | ||
| 1 | 740 | 11.2% |
| 2 | 512 | 7.7% |
| 0 | 509 | 7.7% |
| 3 | 436 | 6.6% |
| 5 | 389 | 5.9% |
| 4 | 350 | 5.3% |
| 6 | 326 | 4.9% |
| 9 | 245 | 3.7% |
| 7 | 238 | 3.6% |
| Other values (6) | 494 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17907 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2382 | 13.3% | |
| e | 1718 | 9.6% |
| t | 1243 | 6.9% |
| r | 759 | 4.2% |
| 1 | 740 | 4.1% |
| a | 654 | 3.7% |
| n | 645 | 3.6% |
| o | 526 | 2.9% |
| 2 | 512 | 2.9% |
| 0 | 509 | 2.8% |
| Other values (56) | 8219 |
mailingaddress2
Text
MISSING 
| Distinct | 330 |
|---|---|
| Distinct (%) | 84.0% |
| Missing | 607 |
| Missing (%) | 60.7% |
| Memory size | 44.0 KiB |
Length
| Max length | 34 |
|---|---|
| Median length | 25 |
| Mean length | 7.786259542 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3060 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 294 ? |
|---|---|
| Unique (%) | 74.8% |
Sample
| 1st row | #23D |
|---|---|
| 2nd row | Apt 4C |
| 3rd row | #16F |
| 4th row | Ground Floor |
| 5th row | Apt 8G |
| Value | Count | Frequency (%) |
| suite | 160 | 21.1% |
| floor | 57 | 7.5% |
| apt | 54 | 7.1% |
| unit | 16 | 2.1% |
| 1 | 16 | 2.1% |
| 1st | 13 | 1.7% |
| 12 | 1.6% | |
| 2 | 11 | 1.4% |
| 3rd | 9 | 1.2% |
| 2nd | 9 | 1.2% |
| Other values (285) | 402 |
Most occurring characters
| Value | Count | Frequency (%) |
| 366 | 12.0% | |
| t | 284 | 9.3% |
| e | 189 | 6.2% |
| i | 187 | 6.1% |
| 1 | 186 | 6.1% |
| S | 172 | 5.6% |
| u | 169 | 5.5% |
| o | 140 | 4.6% |
| 0 | 124 | 4.1% |
| 2 | 112 | 3.7% |
| Other values (52) | 1131 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1343 | |
| Decimal Number | 759 | |
| Uppercase Letter | 522 | 17.1% |
| Space Separator | 366 | 12.0% |
| Other Punctuation | 69 | 2.3% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 172 | |
| F | 81 | |
| A | 74 | |
| B | 24 | 4.6% |
| U | 20 | 3.8% |
| L | 17 | 3.3% |
| C | 15 | 2.9% |
| D | 14 | 2.7% |
| P | 14 | 2.7% |
| T | 12 | 2.3% |
| Other values (13) | 79 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 284 | |
| e | 189 | |
| i | 187 | |
| u | 169 | |
| o | 140 | |
| r | 86 | 6.4% |
| l | 68 | 5.1% |
| p | 52 | 3.9% |
| n | 42 | 3.1% |
| h | 29 | 2.2% |
| Other values (12) | 97 | 7.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 186 | |
| 0 | 124 | |
| 2 | 112 | |
| 3 | 85 | |
| 4 | 70 | 9.2% |
| 6 | 51 | 6.7% |
| 5 | 51 | 6.7% |
| 7 | 35 | 4.6% |
| 8 | 25 | 3.3% |
| 9 | 20 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 46 | |
| . | 11 | 15.9% |
| , | 8 | 11.6% |
| / | 3 | 4.3% |
| & | 1 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 366 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1865 | |
| Common | 1195 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 284 | |
| e | 189 | |
| i | 187 | |
| S | 172 | |
| u | 169 | |
| o | 140 | 7.5% |
| r | 86 | 4.6% |
| F | 81 | 4.3% |
| A | 74 | 4.0% |
| l | 68 | 3.6% |
| Other values (35) | 415 |
Common
| Value | Count | Frequency (%) |
| 366 | ||
| 1 | 186 | |
| 0 | 124 | 10.4% |
| 2 | 112 | 9.4% |
| 3 | 85 | 7.1% |
| 4 | 70 | 5.9% |
| 6 | 51 | 4.3% |
| 5 | 51 | 4.3% |
| # | 46 | 3.8% |
| 7 | 35 | 2.9% |
| Other values (7) | 69 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3060 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 366 | 12.0% | |
| t | 284 | 9.3% |
| e | 189 | 6.2% |
| i | 187 | 6.1% |
| 1 | 186 | 6.1% |
| S | 172 | 5.6% |
| u | 169 | 5.5% |
| o | 140 | 4.6% |
| 0 | 124 | 4.1% |
| 2 | 112 | 3.7% |
| Other values (52) | 1131 |
mailingcity
Text
| Distinct | 311 |
|---|---|
| Distinct (%) | 31.1% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 64.7 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 9.138138138 |
| Min length | 4 |
Characters and Unicode
| Total characters | 9129 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 195 ? |
|---|---|
| Unique (%) | 19.5% |
Sample
| 1st row | Brooklyn |
|---|---|
| 2nd row | Red Bank |
| 3rd row | Bronx |
| 4th row | Brooklyn |
| 5th row | Brooklyn |
| Value | Count | Frequency (%) |
| new | 186 | 12.7% |
| york | 172 | 11.8% |
| brooklyn | 170 | 11.6% |
| bronx | 65 | 4.5% |
| island | 52 | 3.6% |
| city | 31 | 2.1% |
| staten | 30 | 2.1% |
| park | 26 | 1.8% |
| jamaica | 21 | 1.4% |
| long | 20 | 1.4% |
| Other values (300) | 687 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 897 | 9.8% |
| n | 682 | 7.5% |
| r | 680 | 7.4% |
| e | 674 | 7.4% |
| l | 533 | 5.8% |
| a | 517 | 5.7% |
| 461 | 5.0% | |
| k | 455 | 5.0% |
| i | 338 | 3.7% |
| t | 330 | 3.6% |
| Other values (42) | 3562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6918 | |
| Uppercase Letter | 1741 | 19.1% |
| Space Separator | 461 | 5.0% |
| Other Punctuation | 9 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 897 | |
| n | 682 | |
| r | 680 | |
| e | 674 | |
| l | 533 | 7.7% |
| a | 517 | 7.5% |
| k | 455 | 6.6% |
| i | 338 | 4.9% |
| t | 330 | 4.8% |
| s | 315 | 4.6% |
| Other values (15) | 1497 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 288 | |
| N | 224 | |
| Y | 191 | 11.0% |
| S | 120 | 6.9% |
| R | 77 | 4.4% |
| H | 75 | 4.3% |
| M | 69 | 4.0% |
| I | 67 | 3.8% |
| C | 66 | 3.8% |
| L | 64 | 3.7% |
| Other values (15) | 500 |
Space Separator
| Value | Count | Frequency (%) |
| 461 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8659 | |
| Common | 470 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 897 | 10.4% |
| n | 682 | 7.9% |
| r | 680 | 7.9% |
| e | 674 | 7.8% |
| l | 533 | 6.2% |
| a | 517 | 6.0% |
| k | 455 | 5.3% |
| i | 338 | 3.9% |
| t | 330 | 3.8% |
| s | 315 | 3.6% |
| Other values (40) | 3238 |
Common
| Value | Count | Frequency (%) |
| 461 | ||
| . | 9 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9129 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 897 | 9.8% |
| n | 682 | 7.5% |
| r | 680 | 7.4% |
| e | 674 | 7.4% |
| l | 533 | 5.8% |
| a | 517 | 5.7% |
| 461 | 5.0% | |
| k | 455 | 5.0% |
| i | 338 | 3.7% |
| t | 330 | 3.6% |
| Other values (42) | 3562 |
mailingstate
Text
| Distinct | 21 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 63.8 KiB |
Length
| Max length | 20 |
|---|---|
| Median length | 8 |
| Mean length | 8.253 |
| Min length | 4 |
Characters and Unicode
| Total characters | 8253 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | New York |
|---|---|
| 2nd row | New Jersey |
| 3rd row | New York |
| 4th row | New York |
| 5th row | New York |
| Value | Count | Frequency (%) |
| new | 963 | |
| york | 872 | |
| jersey | 91 | 4.6% |
| pennsylvania | 7 | 0.4% |
| georgia | 4 | 0.2% |
| connecticut | 3 | 0.2% |
| illinois | 3 | 0.2% |
| maryland | 2 | 0.1% |
| california | 2 | 0.1% |
| massachusetts | 2 | 0.1% |
| Other values (16) | 20 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1165 | |
| r | 978 | |
| 969 | ||
| N | 964 | |
| w | 963 | |
| o | 897 | |
| Y | 872 | |
| k | 872 | |
| s | 114 | 1.4% |
| y | 100 | 1.2% |
| Other values (28) | 359 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5317 | |
| Uppercase Letter | 1967 | 23.8% |
| Space Separator | 969 | 11.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1165 | |
| r | 978 | |
| w | 963 | |
| o | 897 | |
| k | 872 | |
| s | 114 | 2.1% |
| y | 100 | 1.9% |
| a | 46 | 0.9% |
| n | 43 | 0.8% |
| i | 43 | 0.8% |
| Other values (12) | 96 | 1.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 964 | |
| Y | 872 | |
| J | 91 | 4.6% |
| C | 9 | 0.5% |
| P | 7 | 0.4% |
| M | 6 | 0.3% |
| I | 5 | 0.3% |
| G | 4 | 0.2% |
| D | 2 | 0.1% |
| V | 2 | 0.1% |
| Other values (5) | 5 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 969 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7284 | |
| Common | 969 | 11.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1165 | |
| r | 978 | |
| N | 964 | |
| w | 963 | |
| o | 897 | |
| Y | 872 | |
| k | 872 | |
| s | 114 | 1.6% |
| y | 100 | 1.4% |
| J | 91 | 1.2% |
| Other values (27) | 268 | 3.7% |
Common
| Value | Count | Frequency (%) |
| 969 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8253 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1165 | |
| r | 978 | |
| 969 | ||
| N | 964 | |
| w | 963 | |
| o | 897 | |
| Y | 872 | |
| k | 872 | |
| s | 114 | 1.4% |
| y | 100 | 1.2% |
| Other values (28) | 359 | 4.3% |
mailingzip
Text
| Distinct | 422 |
|---|---|
| Distinct (%) | 42.2% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 60.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.897897898 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4893 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 220 ? |
|---|---|
| Unique (%) | 22.0% |
Sample
| 1st row | 11211 |
|---|---|
| 2nd row | 7701 |
| 3rd row | 10475 |
| 4th row | 11238 |
| 5th row | 11237 |
| Value | Count | Frequency (%) |
| 10001 | 20 | 2.0% |
| 11101 | 17 | 1.7% |
| 11201 | 14 | 1.4% |
| 11238 | 12 | 1.2% |
| 11230 | 11 | 1.1% |
| 10013 | 10 | 1.0% |
| 11205 | 9 | 0.9% |
| 11801 | 9 | 0.9% |
| 10018 | 9 | 0.9% |
| 11432 | 8 | 0.8% |
| Other values (412) | 880 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1765 | |
| 0 | 918 | |
| 2 | 443 | 9.1% |
| 3 | 366 | 7.5% |
| 7 | 321 | 6.6% |
| 4 | 281 | 5.7% |
| 5 | 267 | 5.5% |
| 6 | 224 | 4.6% |
| 8 | 179 | 3.7% |
| 9 | 129 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4893 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1765 | |
| 0 | 918 | |
| 2 | 443 | 9.1% |
| 3 | 366 | 7.5% |
| 7 | 321 | 6.6% |
| 4 | 281 | 5.7% |
| 5 | 267 | 5.5% |
| 6 | 224 | 4.6% |
| 8 | 179 | 3.7% |
| 9 | 129 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4893 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1765 | |
| 0 | 918 | |
| 2 | 443 | 9.1% |
| 3 | 366 | 7.5% |
| 7 | 321 | 6.6% |
| 4 | 281 | 5.7% |
| 5 | 267 | 5.5% |
| 6 | 224 | 4.6% |
| 8 | 179 | 3.7% |
| 9 | 129 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4893 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1765 | |
| 0 | 918 | |
| 2 | 443 | 9.1% |
| 3 | 366 | 7.5% |
| 7 | 321 | 6.6% |
| 4 | 281 | 5.7% |
| 5 | 267 | 5.5% |
| 6 | 224 | 4.6% |
| 8 | 179 | 3.7% |
| 9 | 129 | 2.6% |
website
URL
MISSING 
| Distinct | 714 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 283 |
| Missing (%) | 28.3% |
| Memory size | 68.3 KiB |
| http://www.3gwhse.com | 2 |
|---|---|
| http://www.activeworldnyc.com | 2 |
| http://www.alwaysfirstdemo.com | 2 |
| http://www.afiglass.com | 1 |
| http://www.amarachirestaurant.com | 1 |
| Other values (709) | |
| (Missing) |
| Value | Count | Frequency (%) |
| http://www.3gwhse.com | 2 | 0.2% |
| http://www.activeworldnyc.com | 2 | 0.2% |
| http://www.alwaysfirstdemo.com | 2 | 0.2% |
| http://www.afiglass.com | 1 | 0.1% |
| http://www.amarachirestaurant.com | 1 | 0.1% |
| http://ammep.com | 1 | 0.1% |
| http://www.asrnyc.com | 1 | 0.1% |
| http://www.allpointscom.com | 1 | 0.1% |
| https://arc-geo.com | 1 | 0.1% |
| https://resolutionmanagement.com | 1 | 0.1% |
| Other values (704) | 704 | |
| (Missing) | 283 |
| Value | Count | Frequency (%) |
| http | 608 | |
| https | 109 | 10.9% |
| (Missing) | 283 |
| Value | Count | Frequency (%) |
| www.3gwhse.com | 2 | 0.2% |
| www.activeworldnyc.com | 2 | 0.2% |
| www.alwaysfirstdemo.com | 2 | 0.2% |
| 4futuregenerations.com | 1 | 0.1% |
| www.ArgentoUSA.com | 1 | 0.1% |
| www.alantesecurity.com | 1 | 0.1% |
| 1starnetworks.com | 1 | 0.1% |
| American Awning & Sign Depot, Inc. | 1 | 0.1% |
| www.aof-isi.com | 1 | 0.1% |
| www.blsecuritygroup.com | 1 | 0.1% |
| Other values (704) | 704 | |
| (Missing) | 283 |
| Value | Count | Frequency (%) |
| 675 | ||
| / | 33 | 3.3% |
| /jerseycity | 1 | 0.1% |
| / * https://helloinsight.org/ | 1 | 0.1% |
| /new-york/staten-island/comfort-inn-hotels/ny470 | 1 | 0.1% |
| //www.nyallran.com | 1 | 0.1% |
| /ride-with-us/ | 1 | 0.1% |
| /hotels/travel/nycal-aloft-manhattan-downtown-financial-district/ | 1 | 0.1% |
| /newworldbiz1 | 1 | 0.1% |
| /new-york | 1 | 0.1% |
| (Missing) | 283 |
| Value | Count | Frequency (%) |
| 717 | ||
| (Missing) | 283 | 28.3% |
| Value | Count | Frequency (%) |
| 717 | ||
| (Missing) | 283 | 28.3% |
MISSING 
| Distinct | 847 |
|---|---|
| Distinct (%) | 92.5% |
| Missing | 84 |
| Missing (%) | 8.4% |
| Memory size | 74.3 KiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Characters and Unicode
| Total characters | 21068 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 787 ? |
|---|---|
| Unique (%) | 85.9% |
Sample
| 1st row | 2019-06-15T00:00:00.000 |
|---|---|
| 2nd row | 2014-12-16T00:00:00.000 |
| 3rd row | 2019-02-08T00:00:00.000 |
| 4th row | 2004-02-02T00:00:00.000 |
| 5th row | 2018-08-23T00:00:00.000 |
| Value | Count | Frequency (%) |
| 2018-01-01t00:00:00.000 | 4 | 0.4% |
| 2005-01-01t00:00:00.000 | 4 | 0.4% |
| 2017-02-13t00:00:00.000 | 3 | 0.3% |
| 2000-01-01t00:00:00.000 | 3 | 0.3% |
| 2016-01-15t00:00:00.000 | 3 | 0.3% |
| 2017-01-09t00:00:00.000 | 3 | 0.3% |
| 2006-01-01t00:00:00.000 | 3 | 0.3% |
| 2006-04-26t00:00:00.000 | 2 | 0.2% |
| 2015-06-12t00:00:00.000 | 2 | 0.2% |
| 2011-01-01t00:00:00.000 | 2 | 0.2% |
| Other values (837) | 887 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10601 | |
| - | 1832 | 8.7% |
| : | 1832 | 8.7% |
| 1 | 1526 | 7.2% |
| 2 | 1361 | 6.5% |
| T | 916 | 4.3% |
| . | 916 | 4.3% |
| 9 | 506 | 2.4% |
| 3 | 287 | 1.4% |
| 8 | 278 | 1.3% |
| Other values (4) | 1013 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15572 | |
| Other Punctuation | 2748 | 13.0% |
| Dash Punctuation | 1832 | 8.7% |
| Uppercase Letter | 916 | 4.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10601 | |
| 1 | 1526 | 9.8% |
| 2 | 1361 | 8.7% |
| 9 | 506 | 3.2% |
| 3 | 287 | 1.8% |
| 8 | 278 | 1.8% |
| 6 | 264 | 1.7% |
| 4 | 261 | 1.7% |
| 5 | 249 | 1.6% |
| 7 | 239 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1832 | |
| . | 916 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1832 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 916 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20152 | |
| Latin | 916 | 4.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10601 | |
| - | 1832 | 9.1% |
| : | 1832 | 9.1% |
| 1 | 1526 | 7.6% |
| 2 | 1361 | 6.8% |
| . | 916 | 4.5% |
| 9 | 506 | 2.5% |
| 3 | 287 | 1.4% |
| 8 | 278 | 1.4% |
| 6 | 264 | 1.3% |
| Other values (3) | 749 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| T | 916 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10601 | |
| - | 1832 | 8.7% |
| : | 1832 | 8.7% |
| 1 | 1526 | 7.2% |
| 2 | 1361 | 6.5% |
| T | 916 | 4.3% |
| . | 916 | 4.3% |
| 9 | 506 | 2.4% |
| 3 | 287 | 1.4% |
| 8 | 278 | 1.3% |
| Other values (4) | 1013 | 4.8% |
MISSING 
| Distinct | 39 |
|---|---|
| Distinct (%) | 35.5% |
| Missing | 890 |
| Missing (%) | 89.0% |
| Memory size | 34.8 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 6.863636364 |
| Min length | 3 |
Characters and Unicode
| Total characters | 755 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 18.2% |
Sample
| 1st row | 10000000 |
|---|---|
| 2nd row | 1000000 |
| 3rd row | 7000000 |
| 4th row | 100000 |
| 5th row | 750000 |
| Value | Count | Frequency (%) |
| 50000 | 11 | 10.0% |
| 1000000 | 10 | 9.1% |
| 10000 | 9 | 8.2% |
| 10000000 | 9 | 8.2% |
| 3000000 | 7 | 6.4% |
| 2000000 | 7 | 6.4% |
| 25000000 | 5 | 4.5% |
| 20000000 | 4 | 3.6% |
| 4000000 | 4 | 3.6% |
| 50000000 | 3 | 2.7% |
| Other values (29) | 41 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 609 | |
| 1 | 44 | 5.8% |
| 5 | 41 | 5.4% |
| 2 | 23 | 3.0% |
| 4 | 14 | 1.9% |
| 3 | 10 | 1.3% |
| 7 | 5 | 0.7% |
| 8 | 4 | 0.5% |
| 9 | 3 | 0.4% |
| 6 | 1 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754 | |
| Dash Punctuation | 1 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 609 | |
| 1 | 44 | 5.8% |
| 5 | 41 | 5.4% |
| 2 | 23 | 3.1% |
| 4 | 14 | 1.9% |
| 3 | 10 | 1.3% |
| 7 | 5 | 0.7% |
| 8 | 4 | 0.5% |
| 9 | 3 | 0.4% |
| 6 | 1 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 755 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 609 | |
| 1 | 44 | 5.8% |
| 5 | 41 | 5.4% |
| 2 | 23 | 3.0% |
| 4 | 14 | 1.9% |
| 3 | 10 | 1.3% |
| 7 | 5 | 0.7% |
| 8 | 4 | 0.5% |
| 9 | 3 | 0.4% |
| 6 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 755 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 609 | |
| 1 | 44 | 5.8% |
| 5 | 41 | 5.4% |
| 2 | 23 | 3.0% |
| 4 | 14 | 1.9% |
| 3 | 10 | 1.3% |
| 7 | 5 | 0.7% |
| 8 | 4 | 0.5% |
| 9 | 3 | 0.4% |
| 6 | 1 | 0.1% |
signatory_to_union_contracts
Text
MISSING 
| Distinct | 68 |
|---|---|
| Distinct (%) | 98.6% |
| Missing | 931 |
| Missing (%) | 93.1% |
| Memory size | 36.2 KiB |
Length
| Max length | 100 |
|---|---|
| Median length | 70 |
| Mean length | 46.14492754 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3184 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 67 ? |
|---|---|
| Unique (%) | 97.1% |
Sample
| 1st row | I.U.O.E Operating Engineers 15, Building, Concrete Excavating & Common Laborer's Union 731 |
|---|---|
| 2nd row | Union Local 78 78, Union Local 12A 12A |
| 3rd row | Mason Tender 79, Laborer Union 20 |
| 4th row | District Council 9 NY 1087, NY District Council of Carpenters 157 |
| 5th row | Pavers & Road Builders 1010, Construction & General Building Laborers 79, General Building Laborers |
| Value | Count | Frequency (%) |
| local | 27 | 5.3% |
| union | 22 | 4.3% |
| laborers | 17 | 3.3% |
| of | 15 | 2.9% |
| 3 | 13 | 2.5% |
| 79 | 13 | 2.5% |
| ibew | 11 | 2.1% |
| metal | 10 | 2.0% |
| mason | 10 | 2.0% |
| 78 | 9 | 1.8% |
| Other values (171) | 365 |
Most occurring characters
| Value | Count | Frequency (%) |
| 454 | 14.3% | |
| e | 221 | 6.9% |
| r | 220 | 6.9% |
| n | 180 | 5.7% |
| o | 178 | 5.6% |
| a | 164 | 5.2% |
| s | 137 | 4.3% |
| t | 136 | 4.3% |
| i | 136 | 4.3% |
| l | 96 | 3.0% |
| Other values (54) | 1262 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1823 | |
| Uppercase Letter | 466 | 14.6% |
| Space Separator | 454 | 14.3% |
| Decimal Number | 336 | 10.6% |
| Other Punctuation | 99 | 3.1% |
| Dash Punctuation | 4 | 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 221 | |
| r | 220 | |
| n | 180 | |
| o | 178 | |
| a | 164 | |
| s | 137 | |
| t | 136 | |
| i | 136 | |
| l | 96 | 5.3% |
| c | 74 | 4.1% |
| Other values (13) | 281 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 53 | |
| C | 44 | 9.4% |
| I | 38 | 8.2% |
| E | 38 | 8.2% |
| A | 34 | 7.3% |
| U | 30 | 6.4% |
| T | 29 | 6.2% |
| B | 29 | 6.2% |
| M | 22 | 4.7% |
| S | 21 | 4.5% |
| Other values (11) | 128 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 58 | |
| 8 | 42 | |
| 2 | 39 | |
| 7 | 39 | |
| 3 | 33 | |
| 0 | 30 | |
| 5 | 25 | |
| 6 | 25 | |
| 4 | 23 | 6.8% |
| 9 | 22 | 6.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 76 | |
| . | 9 | 9.1% |
| ' | 5 | 5.1% |
| & | 5 | 5.1% |
| / | 3 | 3.0% |
| # | 1 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 454 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2289 | |
| Common | 895 | 28.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 221 | 9.7% |
| r | 220 | 9.6% |
| n | 180 | 7.9% |
| o | 178 | 7.8% |
| a | 164 | 7.2% |
| s | 137 | 6.0% |
| t | 136 | 5.9% |
| i | 136 | 5.9% |
| l | 96 | 4.2% |
| c | 74 | 3.2% |
| Other values (34) | 747 |
Common
| Value | Count | Frequency (%) |
| 454 | ||
| , | 76 | 8.5% |
| 1 | 58 | 6.5% |
| 8 | 42 | 4.7% |
| 2 | 39 | 4.4% |
| 7 | 39 | 4.4% |
| 3 | 33 | 3.7% |
| 0 | 30 | 3.4% |
| 5 | 25 | 2.8% |
| 6 | 25 | 2.8% |
| Other values (10) | 74 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3184 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 454 | 14.3% | |
| e | 221 | 6.9% |
| r | 220 | 6.9% |
| n | 180 | 5.7% |
| o | 178 | 5.6% |
| a | 164 | 5.2% |
| s | 137 | 4.3% |
| t | 136 | 4.3% |
| i | 136 | 4.3% |
| l | 96 | 3.0% |
| Other values (54) | 1262 |
| Distinct | 247 |
|---|---|
| Distinct (%) | 24.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.6 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 6000 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 113 ? |
|---|---|
| Unique (%) | 11.3% |
Sample
| 1st row | 722511 |
|---|---|
| 2nd row | 541330 |
| 3rd row | 453998 |
| 4th row | 561720 |
| 5th row | 621610 |
| Value | Count | Frequency (%) |
| 541330 | 40 | 4.0% |
| 541310 | 39 | 3.9% |
| 236220 | 31 | 3.1% |
| 541990 | 30 | 3.0% |
| 238990 | 29 | 2.9% |
| 236118 | 28 | 2.8% |
| 238210 | 27 | 2.7% |
| 541611 | 27 | 2.7% |
| 561720 | 24 | 2.4% |
| 611710 | 21 | 2.1% |
| Other values (237) | 704 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1358 | |
| 2 | 808 | |
| 0 | 750 | |
| 3 | 715 | |
| 5 | 588 | |
| 4 | 570 | |
| 9 | 410 | 6.8% |
| 6 | 368 | 6.1% |
| 8 | 305 | 5.1% |
| 7 | 128 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1358 | |
| 2 | 808 | |
| 0 | 750 | |
| 3 | 715 | |
| 5 | 588 | |
| 4 | 570 | |
| 9 | 410 | 6.8% |
| 6 | 368 | 6.1% |
| 8 | 305 | 5.1% |
| 7 | 128 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1358 | |
| 2 | 808 | |
| 0 | 750 | |
| 3 | 715 | |
| 5 | 588 | |
| 4 | 570 | |
| 9 | 410 | 6.8% |
| 6 | 368 | 6.1% |
| 8 | 305 | 5.1% |
| 7 | 128 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1358 | |
| 2 | 808 | |
| 0 | 750 | |
| 3 | 715 | |
| 5 | 588 | |
| 4 | 570 | |
| 9 | 410 | 6.8% |
| 6 | 368 | 6.1% |
| 8 | 305 | 5.1% |
| 7 | 128 | 2.1% |
naics_sector
Text
| Distinct | 19 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 87.1 KiB |
Length
| Max length | 72 |
|---|---|
| Median length | 48 |
| Mean length | 32.012 |
| Min length | 9 |
Characters and Unicode
| Total characters | 32012 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Accommodation and Food Services |
|---|---|
| 2nd row | Professional, Scientific, and Technical Services |
| 3rd row | Retail Trade |
| 4th row | Administrative and Support and Waste Management and Remediation Services |
| 5th row | Health Care and Social Assistance |
| Value | Count | Frequency (%) |
| and | 722 | |
| services | 473 | |
| professional | 295 | 8.3% |
| scientific | 295 | 8.3% |
| technical | 295 | 8.3% |
| construction | 257 | 7.2% |
| management | 89 | 2.5% |
| administrative | 88 | 2.5% |
| support | 88 | 2.5% |
| waste | 88 | 2.5% |
| Other values (36) | 871 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3152 | 9.8% |
| n | 3054 | 9.5% |
| e | 2944 | 9.2% |
| 2561 | 8.0% | |
| a | 2516 | 7.9% |
| c | 2220 | 6.9% |
| t | 1919 | 6.0% |
| s | 1801 | 5.6% |
| o | 1709 | 5.3% |
| r | 1611 | 5.0% |
| Other values (30) | 8525 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25948 | |
| Uppercase Letter | 2815 | 8.8% |
| Space Separator | 2561 | 8.0% |
| Other Punctuation | 642 | 2.0% |
| Open Punctuation | 23 | 0.1% |
| Close Punctuation | 23 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3152 | |
| n | 3054 | |
| e | 2944 | |
| a | 2516 | |
| c | 2220 | |
| t | 1919 | |
| s | 1801 | |
| o | 1709 | |
| r | 1611 | |
| d | 1092 | 4.2% |
| Other values (11) | 3930 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 889 | |
| T | 397 | |
| P | 324 | 11.5% |
| C | 291 | 10.3% |
| A | 201 | 7.1% |
| R | 182 | 6.5% |
| W | 154 | 5.5% |
| M | 133 | 4.7% |
| E | 84 | 3.0% |
| F | 44 | 1.6% |
| Other values (5) | 116 | 4.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2561 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 642 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 23 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28763 | |
| Common | 3249 | 10.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3152 | |
| n | 3054 | |
| e | 2944 | |
| a | 2516 | 8.7% |
| c | 2220 | 7.7% |
| t | 1919 | 6.7% |
| s | 1801 | 6.3% |
| o | 1709 | 5.9% |
| r | 1611 | 5.6% |
| d | 1092 | 3.8% |
| Other values (26) | 6745 |
Common
| Value | Count | Frequency (%) |
| 2561 | ||
| , | 642 | 19.8% |
| ( | 23 | 0.7% |
| ) | 23 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32012 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3152 | 9.8% |
| n | 3054 | 9.5% |
| e | 2944 | 9.2% |
| 2561 | 8.0% | |
| a | 2516 | 7.9% |
| c | 2220 | 6.9% |
| t | 1919 | 6.0% |
| s | 1801 | 5.6% |
| o | 1709 | 5.3% |
| r | 1611 | 5.0% |
| Other values (30) | 8525 |
naics_subsector
Text
| Distinct | 145 |
|---|---|
| Distinct (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 95.9 KiB |
Length
| Max length | 102 |
|---|---|
| Median length | 71 |
| Mean length | 41.027 |
| Min length | 12 |
Characters and Unicode
| Total characters | 41027 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 55 ? |
|---|---|
| Unique (%) | 5.5% |
Sample
| 1st row | Restaurants and Other Eating Places |
|---|---|
| 2nd row | Architectural, Engineering, and Related Services |
| 3rd row | Other Miscellaneous Store Retailers |
| 4th row | Services to Buildings and Dwellings |
| 5th row | Home Health Care Services |
| Value | Count | Frequency (%) |
| and | 611 | 13.0% |
| services | 453 | 9.7% |
| building | 215 | 4.6% |
| other | 189 | 4.0% |
| contractors | 175 | 3.7% |
| related | 160 | 3.4% |
| technical | 115 | 2.5% |
| scientific | 114 | 2.4% |
| architectural | 90 | 1.9% |
| engineering | 89 | 1.9% |
| Other values (308) | 2477 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4016 | 9.8% |
| 3688 | 9.0% | |
| i | 3570 | 8.7% |
| n | 3338 | 8.1% |
| t | 2729 | 6.7% |
| r | 2639 | 6.4% |
| a | 2496 | 6.1% |
| s | 2010 | 4.9% |
| c | 1993 | 4.9% |
| o | 1718 | 4.2% |
| Other values (42) | 12830 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32584 | |
| Uppercase Letter | 3998 | 9.7% |
| Space Separator | 3688 | 9.0% |
| Other Punctuation | 755 | 1.8% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4016 | |
| i | 3570 | |
| n | 3338 | |
| t | 2729 | |
| r | 2639 | |
| a | 2496 | 7.7% |
| s | 2010 | 6.2% |
| c | 1993 | 6.1% |
| o | 1718 | 5.3% |
| l | 1533 | 4.7% |
| Other values (15) | 6542 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 903 | |
| C | 430 | |
| E | 306 | 7.7% |
| B | 303 | 7.6% |
| R | 297 | 7.4% |
| A | 245 | 6.1% |
| P | 236 | 5.9% |
| O | 218 | 5.5% |
| T | 217 | 5.4% |
| M | 215 | 5.4% |
| Other values (12) | 628 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 753 | |
| ; | 2 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 3688 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36582 | |
| Common | 4445 | 10.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4016 | 11.0% |
| i | 3570 | 9.8% |
| n | 3338 | 9.1% |
| t | 2729 | 7.5% |
| r | 2639 | 7.2% |
| a | 2496 | 6.8% |
| s | 2010 | 5.5% |
| c | 1993 | 5.4% |
| o | 1718 | 4.7% |
| l | 1533 | 4.2% |
| Other values (37) | 10540 |
Common
| Value | Count | Frequency (%) |
| 3688 | ||
| , | 753 | 16.9% |
| ; | 2 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41027 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4016 | 9.8% |
| 3688 | 9.0% | |
| i | 3570 | 8.7% |
| n | 3338 | 8.1% |
| t | 2729 | 6.7% |
| r | 2639 | 6.4% |
| a | 2496 | 6.1% |
| s | 2010 | 4.9% |
| c | 1993 | 4.9% |
| o | 1718 | 4.2% |
| Other values (42) | 12830 |
naics_title
Text
| Distinct | 247 |
|---|---|
| Distinct (%) | 24.8% |
| Missing | 3 |
| Missing (%) | 0.3% |
| Memory size | 92.6 KiB |
Length
| Max length | 107 |
|---|---|
| Median length | 68 |
| Mean length | 37.84653962 |
| Min length | 8 |
Characters and Unicode
| Total characters | 37733 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 113 ? |
|---|---|
| Unique (%) | 11.3% |
Sample
| 1st row | Full-Service Restaurants |
|---|---|
| 2nd row | Engineering Services |
| 3rd row | All Other Miscellaneous Store Retailers (except Tobacco Stores) |
| 4th row | Janitorial Services |
| 5th row | Home Health Care Services |
| Value | Count | Frequency (%) |
| and | 441 | 10.0% |
| services | 406 | 9.2% |
| other | 243 | 5.5% |
| contractors | 204 | 4.6% |
| all | 97 | 2.2% |
| management | 71 | 1.6% |
| consulting | 66 | 1.5% |
| building | 60 | 1.4% |
| construction | 55 | 1.3% |
| technical | 47 | 1.1% |
| Other values (465) | 2707 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3629 | 9.6% |
| 3400 | 9.0% | |
| i | 2876 | 7.6% |
| r | 2821 | 7.5% |
| n | 2815 | 7.5% |
| t | 2656 | 7.0% |
| a | 2427 | 6.4% |
| s | 2000 | 5.3% |
| o | 1858 | 4.9% |
| c | 1718 | 4.6% |
| Other values (43) | 11533 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30020 | |
| Uppercase Letter | 3900 | 10.3% |
| Space Separator | 3400 | 9.0% |
| Other Punctuation | 274 | 0.7% |
| Dash Punctuation | 47 | 0.1% |
| Open Punctuation | 46 | 0.1% |
| Close Punctuation | 46 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3629 | |
| i | 2876 | |
| r | 2821 | |
| n | 2815 | |
| t | 2656 | |
| a | 2427 | |
| s | 2000 | 6.7% |
| o | 1858 | 6.2% |
| c | 1718 | 5.7% |
| l | 1572 | 5.2% |
| Other values (16) | 5648 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 759 | |
| C | 556 | |
| A | 332 | |
| O | 295 | 7.6% |
| P | 285 | 7.3% |
| M | 242 | 6.2% |
| E | 196 | 5.0% |
| R | 194 | 5.0% |
| T | 151 | 3.9% |
| I | 130 | 3.3% |
| Other values (11) | 760 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 271 | |
| ' | 3 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3400 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 47 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 46 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 46 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33920 | |
| Common | 3813 | 10.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3629 | 10.7% |
| i | 2876 | 8.5% |
| r | 2821 | 8.3% |
| n | 2815 | 8.3% |
| t | 2656 | 7.8% |
| a | 2427 | 7.2% |
| s | 2000 | 5.9% |
| o | 1858 | 5.5% |
| c | 1718 | 5.1% |
| l | 1572 | 4.6% |
| Other values (37) | 9548 |
Common
| Value | Count | Frequency (%) |
| 3400 | ||
| , | 271 | 7.1% |
| - | 47 | 1.2% |
| ( | 46 | 1.2% |
| ) | 46 | 1.2% |
| ' | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37733 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3629 | 9.6% |
| 3400 | 9.0% | |
| i | 2876 | 7.6% |
| r | 2821 | 7.5% |
| n | 2815 | 7.5% |
| t | 2656 | 7.0% |
| a | 2427 | 6.4% |
| s | 2000 | 5.3% |
| o | 1858 | 4.9% |
| c | 1718 | 4.6% |
| Other values (43) | 11533 |
types_of_construction_projects_performed
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1000 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 7.9 KiB |
nigp_codes
Text
| Distinct | 883 |
|---|---|
| Distinct (%) | 88.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 94.1 KiB |
Length
| Max length | 797 |
|---|---|
| Median length | 653 |
| Mean length | 39.19 |
| Min length | 3 |
Characters and Unicode
| Total characters | 39190 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 828 ? |
|---|---|
| Unique (%) | 82.8% |
Sample
| 1st row | 37003 |
|---|---|
| 2nd row | 91800 | 92500 |
| 3rd row | 39356 |
| 4th row | 48500 |
| 5th row | 94845 |
| Value | Count | Frequency (%) |
| 4275 | ||
| 91800 | 74 | 0.8% |
| 91200 | 46 | 0.5% |
| 91455 | 26 | 0.3% |
| 92400 | 24 | 0.3% |
| 91872 | 23 | 0.2% |
| 91548 | 23 | 0.2% |
| 90607 | 22 | 0.2% |
| 91461 | 22 | 0.2% |
| 91841 | 21 | 0.2% |
| Other values (2520) | 4994 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8550 | ||
| 0 | 4750 | |
| | | 4275 | |
| 9 | 3738 | |
| 5 | 2988 | 7.6% |
| 1 | 2689 | 6.9% |
| 2 | 2546 | 6.5% |
| 4 | 2303 | 5.9% |
| 8 | 2257 | 5.8% |
| 6 | 1995 | 5.1% |
| Other values (2) | 3099 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26365 | |
| Space Separator | 8550 | 21.8% |
| Math Symbol | 4275 | 10.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4750 | |
| 9 | 3738 | |
| 5 | 2988 | |
| 1 | 2689 | |
| 2 | 2546 | |
| 4 | 2303 | |
| 8 | 2257 | |
| 6 | 1995 | |
| 3 | 1641 | 6.2% |
| 7 | 1458 | 5.5% |
Space Separator
| Value | Count | Frequency (%) |
| 8550 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 4275 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 39190 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8550 | ||
| 0 | 4750 | |
| | | 4275 | |
| 9 | 3738 | |
| 5 | 2988 | 7.6% |
| 1 | 2689 | 6.9% |
| 2 | 2546 | 6.5% |
| 4 | 2303 | 5.9% |
| 8 | 2257 | 5.8% |
| 6 | 1995 | 5.1% |
| Other values (2) | 3099 | 7.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39190 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8550 | ||
| 0 | 4750 | |
| | | 4275 | |
| 9 | 3738 | |
| 5 | 2988 | 7.6% |
| 1 | 2689 | 6.9% |
| 2 | 2546 | 6.5% |
| 4 | 2303 | 5.9% |
| 8 | 2257 | 5.8% |
| 6 | 1995 | 5.1% |
| Other values (2) | 3099 | 7.9% |
MISSING 
| Distinct | 907 |
|---|---|
| Distinct (%) | 94.7% |
| Missing | 42 |
| Missing (%) | 4.2% |
| Memory size | 73.3 KiB |
Length
| Max length | 91 |
|---|---|
| Median length | 50 |
| Mean length | 19.68997912 |
| Min length | 3 |
Characters and Unicode
| Total characters | 18863 |
|---|---|
| Distinct characters | 74 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 875 ? |
|---|---|
| Unique (%) | 91.3% |
Sample
| 1st row | Rethink |
|---|---|
| 2nd row | Walsh Construction |
| 3rd row | Tamara Wilson |
| 4th row | La Land Baptiste LLC |
| 5th row | Jan Jaskot |
| Value | Count | Frequency (%) |
| of | 91 | 3.2% |
| nyc | 68 | 2.4% |
| inc | 66 | 2.3% |
| construction | 64 | 2.3% |
| 55 | 1.9% | |
| llc | 53 | 1.9% |
| new | 42 | 1.5% |
| department | 40 | 1.4% |
| services | 36 | 1.3% |
| york | 36 | 1.3% |
| Other values (1402) | 2289 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1885 | 10.0% | |
| e | 1402 | 7.4% |
| o | 1200 | 6.4% |
| n | 1185 | 6.3% |
| t | 1116 | 5.9% |
| r | 1056 | 5.6% |
| a | 1009 | 5.3% |
| i | 997 | 5.3% |
| s | 702 | 3.7% |
| C | 613 | 3.2% |
| Other values (64) | 7698 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12404 | |
| Uppercase Letter | 4218 | 22.4% |
| Space Separator | 1885 | 10.0% |
| Other Punctuation | 226 | 1.2% |
| Decimal Number | 82 | 0.4% |
| Dash Punctuation | 23 | 0.1% |
| Open Punctuation | 10 | 0.1% |
| Close Punctuation | 10 | 0.1% |
| Math Symbol | 3 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1402 | |
| o | 1200 | |
| n | 1185 | |
| t | 1116 | |
| r | 1056 | 8.5% |
| a | 1009 | 8.1% |
| i | 997 | 8.0% |
| s | 702 | 5.7% |
| l | 581 | 4.7% |
| c | 510 | 4.1% |
| Other values (16) | 2646 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 613 | |
| S | 332 | 7.9% |
| A | 321 | 7.6% |
| N | 273 | 6.5% |
| L | 235 | 5.6% |
| E | 207 | 4.9% |
| D | 200 | 4.7% |
| H | 199 | 4.7% |
| I | 199 | 4.7% |
| M | 193 | 4.6% |
| Other values (16) | 1446 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 27 | |
| 3 | 10 | 12.2% |
| 0 | 10 | 12.2% |
| 2 | 8 | 9.8% |
| 5 | 7 | 8.5% |
| 4 | 6 | 7.3% |
| 9 | 4 | 4.9% |
| 6 | 4 | 4.9% |
| 8 | 3 | 3.7% |
| 7 | 3 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 93 | |
| , | 52 | |
| & | 45 | |
| / | 27 | 11.9% |
| ' | 9 | 4.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 | |
| | | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1885 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16622 | |
| Common | 2241 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1402 | 8.4% |
| o | 1200 | 7.2% |
| n | 1185 | 7.1% |
| t | 1116 | 6.7% |
| r | 1056 | 6.4% |
| a | 1009 | 6.1% |
| i | 997 | 6.0% |
| s | 702 | 4.2% |
| C | 613 | 3.7% |
| l | 581 | 3.5% |
| Other values (42) | 6761 |
Common
| Value | Count | Frequency (%) |
| 1885 | ||
| . | 93 | 4.1% |
| , | 52 | 2.3% |
| & | 45 | 2.0% |
| / | 27 | 1.2% |
| 1 | 27 | 1.2% |
| - | 23 | 1.0% |
| 3 | 10 | 0.4% |
| 0 | 10 | 0.4% |
| ( | 10 | 0.4% |
| Other values (12) | 59 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18861 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1885 | 10.0% | |
| e | 1402 | 7.4% |
| o | 1200 | 6.4% |
| n | 1185 | 6.3% |
| t | 1116 | 5.9% |
| r | 1056 | 5.6% |
| a | 1009 | 5.3% |
| i | 997 | 5.3% |
| s | 702 | 3.7% |
| C | 613 | 3.3% |
| Other values (63) | 7696 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 2 |
MISSING 
| Distinct | 654 |
|---|---|
| Distinct (%) | 69.0% |
| Missing | 52 |
| Missing (%) | 5.2% |
| Memory size | 59.7 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 5.581223629 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5291 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 545 ? |
|---|---|
| Unique (%) | 57.5% |
Sample
| 1st row | 50000 |
|---|---|
| 2nd row | 273350 |
| 3rd row | 20 |
| 4th row | 200000 |
| 5th row | 150 |
| Value | Count | Frequency (%) |
| 5000 | 14 | 1.5% |
| 500000 | 12 | 1.3% |
| 20000 | 10 | 1.1% |
| 50000 | 10 | 1.1% |
| 250000 | 10 | 1.1% |
| 200000 | 10 | 1.1% |
| 300000 | 9 | 0.9% |
| 100000 | 9 | 0.9% |
| 150000 | 9 | 0.9% |
| 400000 | 8 | 0.8% |
| Other values (643) | 847 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2310 | |
| 1 | 474 | 9.0% |
| 5 | 461 | 8.7% |
| 2 | 391 | 7.4% |
| 7 | 298 | 5.6% |
| 4 | 293 | 5.5% |
| 3 | 282 | 5.3% |
| 8 | 269 | 5.1% |
| 6 | 249 | 4.7% |
| 9 | 218 | 4.1% |
| Other values (2) | 46 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5245 | |
| Other Punctuation | 45 | 0.9% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2310 | |
| 1 | 474 | 9.0% |
| 5 | 461 | 8.8% |
| 2 | 391 | 7.5% |
| 7 | 298 | 5.7% |
| 4 | 293 | 5.6% |
| 3 | 282 | 5.4% |
| 8 | 269 | 5.1% |
| 6 | 249 | 4.7% |
| 9 | 218 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 45 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5291 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2310 | |
| 1 | 474 | 9.0% |
| 5 | 461 | 8.7% |
| 2 | 391 | 7.4% |
| 7 | 298 | 5.6% |
| 4 | 293 | 5.5% |
| 3 | 282 | 5.3% |
| 8 | 269 | 5.1% |
| 6 | 249 | 4.7% |
| 9 | 218 | 4.1% |
| Other values (2) | 46 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5291 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2310 | |
| 1 | 474 | 9.0% |
| 5 | 461 | 8.7% |
| 2 | 391 | 7.4% |
| 7 | 298 | 5.6% |
| 4 | 293 | 5.5% |
| 3 | 282 | 5.3% |
| 8 | 269 | 5.1% |
| 6 | 249 | 4.7% |
| 9 | 218 | 4.1% |
| Other values (2) | 46 | 0.9% |
percent_self_performed_job_exp_1
Text
MISSING 
| Distinct | 30 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 97 |
| Missing (%) | 9.7% |
| Memory size | 55.9 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.782945736 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2513 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 100 |
|---|---|
| 2nd row | 100 |
| 3rd row | 100 |
| 4th row | 100 |
| 5th row | 100 |
| Value | Count | Frequency (%) |
| 100 | 727 | |
| 50 | 26 | 2.9% |
| 90 | 17 | 1.9% |
| 0 | 17 | 1.9% |
| 80 | 14 | 1.6% |
| 75 | 13 | 1.4% |
| 95 | 12 | 1.3% |
| 70 | 10 | 1.1% |
| 20 | 9 | 1.0% |
| 60 | 7 | 0.8% |
| Other values (20) | 51 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1567 | |
| 1 | 735 | |
| 5 | 79 | 3.1% |
| 9 | 34 | 1.4% |
| 8 | 25 | 1.0% |
| 7 | 25 | 1.0% |
| 2 | 16 | 0.6% |
| 6 | 14 | 0.6% |
| 3 | 10 | 0.4% |
| 4 | 8 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2513 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1567 | |
| 1 | 735 | |
| 5 | 79 | 3.1% |
| 9 | 34 | 1.4% |
| 8 | 25 | 1.0% |
| 7 | 25 | 1.0% |
| 2 | 16 | 0.6% |
| 6 | 14 | 0.6% |
| 3 | 10 | 0.4% |
| 4 | 8 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2513 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1567 | |
| 1 | 735 | |
| 5 | 79 | 3.1% |
| 9 | 34 | 1.4% |
| 8 | 25 | 1.0% |
| 7 | 25 | 1.0% |
| 2 | 16 | 0.6% |
| 6 | 14 | 0.6% |
| 3 | 10 | 0.4% |
| 4 | 8 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2513 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1567 | |
| 1 | 735 | |
| 5 | 79 | 3.1% |
| 9 | 34 | 1.4% |
| 8 | 25 | 1.0% |
| 7 | 25 | 1.0% |
| 2 | 16 | 0.6% |
| 6 | 14 | 0.6% |
| 3 | 10 | 0.4% |
| 4 | 8 | 0.3% |
MISSING 
| Distinct | 706 |
|---|---|
| Distinct (%) | 73.7% |
| Missing | 42 |
| Missing (%) | 4.2% |
| Memory size | 76.3 KiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Characters and Unicode
| Total characters | 22034 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 561 ? |
|---|---|
| Unique (%) | 58.6% |
Sample
| 1st row | 2020-04-01T00:00:00.000 |
|---|---|
| 2nd row | 2020-12-08T00:00:00.000 |
| 3rd row | 2020-10-06T00:00:00.000 |
| 4th row | 2018-01-01T00:00:00.000 |
| 5th row | 2021-05-28T00:00:00.000 |
| Value | Count | Frequency (%) |
| 2018-01-01t00:00:00.000 | 16 | 1.7% |
| 2019-01-01t00:00:00.000 | 9 | 0.9% |
| 2020-01-01t00:00:00.000 | 9 | 0.9% |
| 2015-01-01t00:00:00.000 | 7 | 0.7% |
| 2021-07-01t00:00:00.000 | 7 | 0.7% |
| 2022-01-01t00:00:00.000 | 6 | 0.6% |
| 2017-03-01t00:00:00.000 | 5 | 0.5% |
| 2019-04-01t00:00:00.000 | 5 | 0.5% |
| 2020-01-15t00:00:00.000 | 5 | 0.5% |
| 2019-07-01t00:00:00.000 | 5 | 0.5% |
| Other values (696) | 884 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11043 | |
| 2 | 1921 | 8.7% |
| - | 1916 | 8.7% |
| : | 1916 | 8.7% |
| 1 | 1715 | 7.8% |
| T | 958 | 4.3% |
| . | 958 | 4.3% |
| 3 | 312 | 1.4% |
| 9 | 294 | 1.3% |
| 8 | 261 | 1.2% |
| Other values (4) | 740 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16286 | |
| Other Punctuation | 2874 | 13.0% |
| Dash Punctuation | 1916 | 8.7% |
| Uppercase Letter | 958 | 4.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11043 | |
| 2 | 1921 | 11.8% |
| 1 | 1715 | 10.5% |
| 3 | 312 | 1.9% |
| 9 | 294 | 1.8% |
| 8 | 261 | 1.6% |
| 7 | 217 | 1.3% |
| 6 | 186 | 1.1% |
| 4 | 183 | 1.1% |
| 5 | 154 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1916 | |
| . | 958 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1916 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 958 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21076 | |
| Latin | 958 | 4.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 11043 | |
| 2 | 1921 | 9.1% |
| - | 1916 | 9.1% |
| : | 1916 | 9.1% |
| 1 | 1715 | 8.1% |
| . | 958 | 4.5% |
| 3 | 312 | 1.5% |
| 9 | 294 | 1.4% |
| 8 | 261 | 1.2% |
| 7 | 217 | 1.0% |
| Other values (3) | 523 | 2.5% |
Latin
| Value | Count | Frequency (%) |
| T | 958 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22034 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11043 | |
| 2 | 1921 | 8.7% |
| - | 1916 | 8.7% |
| : | 1916 | 8.7% |
| 1 | 1715 | 7.8% |
| T | 958 | 4.3% |
| . | 958 | 4.3% |
| 3 | 312 | 1.4% |
| 9 | 294 | 1.3% |
| 8 | 261 | 1.2% |
| Other values (4) | 740 | 3.4% |
description_of_work_job_exp_1
Text
MISSING 
| Distinct | 948 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 43 |
| Missing (%) | 4.3% |
| Memory size | 111.7 KiB |
Length
| Max length | 100 |
|---|---|
| Median length | 81 |
| Mean length | 59.68234065 |
| Min length | 3 |
Characters and Unicode
| Total characters | 57116 |
|---|---|
| Distinct characters | 86 |
| Distinct categories | 13 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 941 ? |
|---|---|
| Unique (%) | 98.3% |
Sample
| 1st row | Produce Meals for low income and elderly |
|---|---|
| 2nd row | design of initial support design of geotechnical instrumentation, onsite construction management |
| 3rd row | 024 Inc sold wax melts to the customer Tamara Wilson |
| 4th row | Janitorial. |
| 5th row | Home Health Care |
| Value | Count | Frequency (%) |
| and | 444 | 5.5% |
| of | 175 | 2.2% |
| the | 169 | 2.1% |
| for | 166 | 2.1% |
| services | 133 | 1.7% |
| to | 127 | 1.6% |
| 104 | 1.3% | |
| a | 90 | 1.1% |
| provided | 84 | 1.0% |
| design | 69 | 0.9% |
| Other values (2766) | 6484 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7079 | 12.4% | |
| e | 4996 | 8.7% |
| n | 3744 | 6.6% |
| i | 3710 | 6.5% |
| a | 3579 | 6.3% |
| t | 3437 | 6.0% |
| o | 3342 | 5.9% |
| r | 3254 | 5.7% |
| s | 2888 | 5.1% |
| l | 2096 | 3.7% |
| Other values (76) | 18991 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 43564 | |
| Space Separator | 7079 | 12.4% |
| Uppercase Letter | 4495 | 7.9% |
| Other Punctuation | 1303 | 2.3% |
| Decimal Number | 410 | 0.7% |
| Dash Punctuation | 106 | 0.2% |
| Control | 73 | 0.1% |
| Open Punctuation | 40 | 0.1% |
| Close Punctuation | 33 | 0.1% |
| Final Punctuation | 5 | < 0.1% |
| Other values (3) | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4996 | |
| n | 3744 | 8.6% |
| i | 3710 | 8.5% |
| a | 3579 | 8.2% |
| t | 3437 | 7.9% |
| o | 3342 | 7.7% |
| r | 3254 | 7.5% |
| s | 2888 | 6.6% |
| l | 2096 | 4.8% |
| d | 2043 | 4.7% |
| Other values (17) | 10475 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 435 | 9.7% |
| P | 386 | 8.6% |
| A | 377 | 8.4% |
| C | 366 | 8.1% |
| R | 323 | 7.2% |
| E | 294 | 6.5% |
| I | 288 | 6.4% |
| T | 256 | 5.7% |
| D | 238 | 5.3% |
| O | 205 | 4.6% |
| Other values (16) | 1327 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 694 | |
| . | 404 | |
| & | 62 | 4.8% |
| / | 59 | 4.5% |
| ' | 33 | 2.5% |
| : | 22 | 1.7% |
| ; | 8 | 0.6% |
| # | 6 | 0.5% |
| • | 5 | 0.4% |
| ? | 4 | 0.3% |
| Other values (3) | 6 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 90 | |
| 1 | 81 | |
| 2 | 71 | |
| 3 | 41 | |
| 5 | 36 | 8.8% |
| 4 | 28 | 6.8% |
| 7 | 20 | 4.9% |
| 8 | 15 | 3.7% |
| 6 | 15 | 3.7% |
| 9 | 13 | 3.2% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 4 | |
| ” | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7079 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 106 |
Control
| Value | Count | Frequency (%) |
| 73 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 40 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 33 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 3 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48059 | |
| Common | 9057 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4996 | 10.4% |
| n | 3744 | 7.8% |
| i | 3710 | 7.7% |
| a | 3579 | 7.4% |
| t | 3437 | 7.2% |
| o | 3342 | 7.0% |
| r | 3254 | 6.8% |
| s | 2888 | 6.0% |
| l | 2096 | 4.4% |
| d | 2043 | 4.3% |
| Other values (43) | 14970 |
Common
| Value | Count | Frequency (%) |
| 7079 | ||
| , | 694 | 7.7% |
| . | 404 | 4.5% |
| - | 106 | 1.2% |
| 0 | 90 | 1.0% |
| 1 | 81 | 0.9% |
| 73 | 0.8% | |
| 2 | 71 | 0.8% |
| & | 62 | 0.7% |
| / | 59 | 0.7% |
| Other values (23) | 338 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57101 | |
| Punctuation | 11 | < 0.1% |
| None | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7079 | 12.4% | |
| e | 4996 | 8.7% |
| n | 3744 | 6.6% |
| i | 3710 | 6.5% |
| a | 3579 | 6.3% |
| t | 3437 | 6.0% |
| o | 3342 | 5.9% |
| r | 3254 | 5.7% |
| s | 2888 | 5.1% |
| l | 2096 | 3.7% |
| Other values (71) | 18976 |
Punctuation
| Value | Count | Frequency (%) |
| • | 5 | |
| ’ | 4 | |
| “ | 1 | 9.1% |
| ” | 1 | 9.1% |
None
| Value | Count | Frequency (%) |
| ç | 4 |
MISSING 
| Distinct | 766 |
|---|---|
| Distinct (%) | 97.1% |
| Missing | 211 |
| Missing (%) | 21.1% |
| Memory size | 66.3 KiB |
Length
| Max length | 100 |
|---|---|
| Median length | 48 |
| Mean length | 20.02281369 |
| Min length | 2 |
Characters and Unicode
| Total characters | 15798 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 748 ? |
|---|---|
| Unique (%) | 94.8% |
Sample
| 1st row | Danielle Warr |
|---|---|
| 2nd row | Liddso |
| 3rd row | Helena Zak |
| 4th row | Truline Construction Services Inc |
| 5th row | Glito Corp |
| Value | Count | Frequency (%) |
| inc | 67 | 2.8% |
| of | 63 | 2.7% |
| construction | 51 | 2.2% |
| nyc | 42 | 1.8% |
| 40 | 1.7% | |
| llc | 39 | 1.7% |
| new | 32 | 1.4% |
| york | 28 | 1.2% |
| corp | 22 | 0.9% |
| and | 22 | 0.9% |
| Other values (1229) | 1957 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1578 | 10.0% | |
| e | 1215 | 7.7% |
| n | 1027 | 6.5% |
| o | 974 | 6.2% |
| r | 912 | 5.8% |
| t | 897 | 5.7% |
| i | 870 | 5.5% |
| a | 830 | 5.3% |
| s | 554 | 3.5% |
| l | 533 | 3.4% |
| Other values (68) | 6408 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10527 | |
| Uppercase Letter | 3347 | 21.2% |
| Space Separator | 1578 | 10.0% |
| Other Punctuation | 191 | 1.2% |
| Decimal Number | 103 | 0.7% |
| Dash Punctuation | 24 | 0.2% |
| Close Punctuation | 12 | 0.1% |
| Open Punctuation | 12 | 0.1% |
| Final Punctuation | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1215 | |
| n | 1027 | |
| o | 974 | |
| r | 912 | |
| t | 897 | 8.5% |
| i | 870 | 8.3% |
| a | 830 | 7.9% |
| s | 554 | 5.3% |
| l | 533 | 5.1% |
| c | 470 | 4.5% |
| Other values (16) | 2245 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 476 | |
| S | 266 | 7.9% |
| A | 234 | 7.0% |
| N | 225 | 6.7% |
| M | 171 | 5.1% |
| I | 167 | 5.0% |
| L | 164 | 4.9% |
| D | 160 | 4.8% |
| P | 159 | 4.8% |
| R | 154 | 4.6% |
| Other values (16) | 1171 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 24 | |
| 1 | 23 | |
| 0 | 11 | |
| 5 | 9 | 8.7% |
| 3 | 8 | 7.8% |
| 7 | 7 | 6.8% |
| 4 | 6 | 5.8% |
| 6 | 6 | 5.8% |
| 9 | 5 | 4.9% |
| 8 | 4 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 82 | |
| , | 42 | |
| & | 37 | |
| / | 21 | 11.0% |
| ' | 7 | 3.7% |
| : | 1 | 0.5% |
| ! | 1 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23 | |
| – | 1 | 4.2% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 | |
| ” | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1578 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13874 | |
| Common | 1924 | 12.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1215 | 8.8% |
| n | 1027 | 7.4% |
| o | 974 | 7.0% |
| r | 912 | 6.6% |
| t | 897 | 6.5% |
| i | 870 | 6.3% |
| a | 830 | 6.0% |
| s | 554 | 4.0% |
| l | 533 | 3.8% |
| C | 476 | 3.4% |
| Other values (42) | 5586 |
Common
| Value | Count | Frequency (%) |
| 1578 | ||
| . | 82 | 4.3% |
| , | 42 | 2.2% |
| & | 37 | 1.9% |
| 2 | 24 | 1.2% |
| 1 | 23 | 1.2% |
| - | 23 | 1.2% |
| / | 21 | 1.1% |
| ) | 12 | 0.6% |
| ( | 12 | 0.6% |
| Other values (16) | 70 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15794 | |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1578 | 10.0% | |
| e | 1215 | 7.7% |
| n | 1027 | 6.5% |
| o | 974 | 6.2% |
| r | 912 | 5.8% |
| t | 897 | 5.7% |
| i | 870 | 5.5% |
| a | 830 | 5.3% |
| s | 554 | 3.5% |
| l | 533 | 3.4% |
| Other values (64) | 6404 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 | |
| “ | 1 | |
| ” | 1 | |
| – | 1 |
value_of_contract_job_exp_2
Text
MISSING 
| Distinct | 541 |
|---|---|
| Distinct (%) | 70.0% |
| Missing | 227 |
| Missing (%) | 22.7% |
| Memory size | 54.1 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 5.058214748 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3910 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 455 ? |
|---|---|
| Unique (%) | 58.9% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 200000 |
| 3rd row | 100 |
| 4th row | 3938 |
| 5th row | 7000 |
| Value | Count | Frequency (%) |
| 5000 | 17 | 2.2% |
| 10000 | 13 | 1.7% |
| 20000 | 10 | 1.3% |
| 40000 | 8 | 1.0% |
| 50000 | 8 | 1.0% |
| 100000 | 8 | 1.0% |
| 500 | 7 | 0.9% |
| 4000 | 7 | 0.9% |
| 100 | 7 | 0.9% |
| 30000 | 7 | 0.9% |
| Other values (531) | 681 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1640 | |
| 1 | 377 | 9.6% |
| 5 | 367 | 9.4% |
| 2 | 289 | 7.4% |
| 3 | 239 | 6.1% |
| 4 | 232 | 5.9% |
| 7 | 202 | 5.2% |
| 8 | 176 | 4.5% |
| 9 | 176 | 4.5% |
| 6 | 169 | 4.3% |
| Other values (2) | 43 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3867 | |
| Other Punctuation | 42 | 1.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1640 | |
| 1 | 377 | 9.7% |
| 5 | 367 | 9.5% |
| 2 | 289 | 7.5% |
| 3 | 239 | 6.2% |
| 4 | 232 | 6.0% |
| 7 | 202 | 5.2% |
| 8 | 176 | 4.6% |
| 9 | 176 | 4.6% |
| 6 | 169 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 42 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3910 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1640 | |
| 1 | 377 | 9.6% |
| 5 | 367 | 9.4% |
| 2 | 289 | 7.4% |
| 3 | 239 | 6.1% |
| 4 | 232 | 5.9% |
| 7 | 202 | 5.2% |
| 8 | 176 | 4.5% |
| 9 | 176 | 4.5% |
| 6 | 169 | 4.3% |
| Other values (2) | 43 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3910 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1640 | |
| 1 | 377 | 9.6% |
| 5 | 367 | 9.4% |
| 2 | 289 | 7.4% |
| 3 | 239 | 6.1% |
| 4 | 232 | 5.9% |
| 7 | 202 | 5.2% |
| 8 | 176 | 4.5% |
| 9 | 176 | 4.5% |
| 6 | 169 | 4.3% |
| Other values (2) | 43 | 1.1% |
percent_self_performed_job_exp_2
Text
MISSING 
| Distinct | 23 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 262 |
| Missing (%) | 26.2% |
| Memory size | 51.4 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.841463415 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2097 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 100 |
|---|---|
| 2nd row | 100 |
| 3rd row | 100 |
| 4th row | 100 |
| 5th row | 100 |
| Value | Count | Frequency (%) |
| 100 | 632 | |
| 50 | 11 | 1.5% |
| 80 | 10 | 1.4% |
| 20 | 10 | 1.4% |
| 75 | 9 | 1.2% |
| 0 | 9 | 1.2% |
| 90 | 9 | 1.2% |
| 85 | 7 | 0.9% |
| 95 | 7 | 0.9% |
| 10 | 6 | 0.8% |
| Other values (13) | 28 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1331 | |
| 1 | 640 | |
| 5 | 44 | 2.1% |
| 8 | 19 | 0.9% |
| 2 | 18 | 0.9% |
| 9 | 16 | 0.8% |
| 7 | 12 | 0.6% |
| 3 | 9 | 0.4% |
| 6 | 8 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2097 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1331 | |
| 1 | 640 | |
| 5 | 44 | 2.1% |
| 8 | 19 | 0.9% |
| 2 | 18 | 0.9% |
| 9 | 16 | 0.8% |
| 7 | 12 | 0.6% |
| 3 | 9 | 0.4% |
| 6 | 8 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2097 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1331 | |
| 1 | 640 | |
| 5 | 44 | 2.1% |
| 8 | 19 | 0.9% |
| 2 | 18 | 0.9% |
| 9 | 16 | 0.8% |
| 7 | 12 | 0.6% |
| 3 | 9 | 0.4% |
| 6 | 8 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2097 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1331 | |
| 1 | 640 | |
| 5 | 44 | 2.1% |
| 8 | 19 | 0.9% |
| 2 | 18 | 0.9% |
| 9 | 16 | 0.8% |
| 7 | 12 | 0.6% |
| 3 | 9 | 0.4% |
| 6 | 8 | 0.4% |
MISSING 
| Distinct | 596 |
|---|---|
| Distinct (%) | 75.5% |
| Missing | 211 |
| Missing (%) | 21.1% |
| Memory size | 68.4 KiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Characters and Unicode
| Total characters | 18147 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 475 ? |
|---|---|
| Unique (%) | 60.2% |
Sample
| 1st row | 2020-09-23T00:00:00.000 |
|---|---|
| 2nd row | 2018-01-01T00:00:00.000 |
| 3rd row | 2021-06-08T00:00:00.000 |
| 4th row | 2019-03-12T00:00:00.000 |
| 5th row | 2020-06-27T00:00:00.000 |
| Value | Count | Frequency (%) |
| 2018-01-01t00:00:00.000 | 10 | 1.3% |
| 2019-02-01t00:00:00.000 | 5 | 0.6% |
| 2013-01-01t00:00:00.000 | 5 | 0.6% |
| 2015-01-01t00:00:00.000 | 5 | 0.6% |
| 2023-01-01t00:00:00.000 | 5 | 0.6% |
| 2021-12-31t00:00:00.000 | 5 | 0.6% |
| 2018-12-01t00:00:00.000 | 5 | 0.6% |
| 2017-01-01t00:00:00.000 | 4 | 0.5% |
| 2019-01-01t00:00:00.000 | 4 | 0.5% |
| 2021-09-01t00:00:00.000 | 4 | 0.5% |
| Other values (586) | 737 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9135 | |
| 2 | 1644 | 9.1% |
| - | 1578 | 8.7% |
| : | 1578 | 8.7% |
| 1 | 1287 | 7.1% |
| T | 789 | 4.3% |
| . | 789 | 4.3% |
| 9 | 249 | 1.4% |
| 3 | 248 | 1.4% |
| 8 | 236 | 1.3% |
| Other values (4) | 614 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13413 | |
| Other Punctuation | 2367 | 13.0% |
| Dash Punctuation | 1578 | 8.7% |
| Uppercase Letter | 789 | 4.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9135 | |
| 2 | 1644 | 12.3% |
| 1 | 1287 | 9.6% |
| 9 | 249 | 1.9% |
| 3 | 248 | 1.8% |
| 8 | 236 | 1.8% |
| 7 | 182 | 1.4% |
| 5 | 159 | 1.2% |
| 6 | 146 | 1.1% |
| 4 | 127 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1578 | |
| . | 789 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1578 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 789 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17358 | |
| Latin | 789 | 4.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9135 | |
| 2 | 1644 | 9.5% |
| - | 1578 | 9.1% |
| : | 1578 | 9.1% |
| 1 | 1287 | 7.4% |
| . | 789 | 4.5% |
| 9 | 249 | 1.4% |
| 3 | 248 | 1.4% |
| 8 | 236 | 1.4% |
| 7 | 182 | 1.0% |
| Other values (3) | 432 | 2.5% |
Latin
| Value | Count | Frequency (%) |
| T | 789 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18147 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9135 | |
| 2 | 1644 | 9.1% |
| - | 1578 | 8.7% |
| : | 1578 | 8.7% |
| 1 | 1287 | 7.1% |
| T | 789 | 4.3% |
| . | 789 | 4.3% |
| 9 | 249 | 1.4% |
| 3 | 248 | 1.4% |
| 8 | 236 | 1.3% |
| Other values (4) | 614 | 3.4% |
description_of_work_job_exp_2
Text
MISSING 
| Distinct | 782 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 211 |
| Missing (%) | 21.1% |
| Memory size | 95.2 KiB |
Length
| Max length | 100 |
|---|---|
| Median length | 79 |
| Mean length | 56.14321926 |
| Min length | 2 |
Characters and Unicode
| Total characters | 44297 |
|---|---|
| Distinct characters | 90 |
| Distinct categories | 13 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 777 ? |
|---|---|
| Unique (%) | 98.5% |
Sample
| 1st row | 024 Inc sold wax melts to the customer Danielle Warr |
|---|---|
| 2nd row | Building maintenance |
| 3rd row | Home Health Care |
| 4th row | Provided construction management consulting services for converting a parking structure into the cha |
| 5th row | Design consultation, construction documents, city agency submission and construction progress inspec |
| Value | Count | Frequency (%) |
| and | 346 | 5.5% |
| of | 144 | 2.3% |
| for | 120 | 1.9% |
| the | 114 | 1.8% |
| to | 106 | 1.7% |
| services | 93 | 1.5% |
| 88 | 1.4% | |
| a | 67 | 1.1% |
| provided | 54 | 0.9% |
| design | 51 | 0.8% |
| Other values (2360) | 5104 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5474 | 12.4% | |
| e | 3804 | 8.6% |
| n | 3003 | 6.8% |
| i | 2902 | 6.6% |
| a | 2796 | 6.3% |
| t | 2741 | 6.2% |
| o | 2578 | 5.8% |
| r | 2468 | 5.6% |
| s | 2149 | 4.9% |
| l | 1637 | 3.7% |
| Other values (80) | 14745 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33616 | |
| Space Separator | 5474 | 12.4% |
| Uppercase Letter | 3742 | 8.4% |
| Other Punctuation | 935 | 2.1% |
| Decimal Number | 321 | 0.7% |
| Dash Punctuation | 84 | 0.2% |
| Control | 61 | 0.1% |
| Open Punctuation | 26 | 0.1% |
| Close Punctuation | 24 | 0.1% |
| Final Punctuation | 7 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3804 | |
| n | 3003 | 8.9% |
| i | 2902 | 8.6% |
| a | 2796 | 8.3% |
| t | 2741 | 8.2% |
| o | 2578 | 7.7% |
| r | 2468 | 7.3% |
| s | 2149 | 6.4% |
| l | 1637 | 4.9% |
| d | 1463 | 4.4% |
| Other values (17) | 8075 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 370 | 9.9% |
| P | 316 | 8.4% |
| A | 311 | 8.3% |
| C | 286 | 7.6% |
| E | 260 | 6.9% |
| I | 257 | 6.9% |
| R | 242 | 6.5% |
| T | 217 | 5.8% |
| D | 209 | 5.6% |
| N | 163 | 4.4% |
| Other values (16) | 1111 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 454 | |
| . | 330 | |
| & | 55 | 5.9% |
| / | 39 | 4.2% |
| : | 20 | 2.1% |
| ' | 19 | 2.0% |
| ; | 5 | 0.5% |
| # | 4 | 0.4% |
| ? | 4 | 0.4% |
| • | 2 | 0.2% |
| Other values (3) | 3 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 92 | |
| 1 | 58 | |
| 2 | 54 | |
| 5 | 27 | 8.4% |
| 3 | 25 | 7.8% |
| 4 | 22 | 6.9% |
| 6 | 15 | 4.7% |
| 8 | 12 | 3.7% |
| 9 | 11 | 3.4% |
| 7 | 5 | 1.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 | |
| > | 1 | |
| < | 1 | |
| ~ | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 82 | |
| – | 2 | 2.4% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 5 | |
| ” | 2 | 28.6% |
Space Separator
| Value | Count | Frequency (%) |
| 5474 |
Control
| Value | Count | Frequency (%) |
| 61 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 26 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 24 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37358 | |
| Common | 6939 | 15.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3804 | 10.2% |
| n | 3003 | 8.0% |
| i | 2902 | 7.8% |
| a | 2796 | 7.5% |
| t | 2741 | 7.3% |
| o | 2578 | 6.9% |
| r | 2468 | 6.6% |
| s | 2149 | 5.8% |
| l | 1637 | 4.4% |
| d | 1463 | 3.9% |
| Other values (43) | 11817 |
Common
| Value | Count | Frequency (%) |
| 5474 | ||
| , | 454 | 6.5% |
| . | 330 | 4.8% |
| 0 | 92 | 1.3% |
| - | 82 | 1.2% |
| 61 | 0.9% | |
| 1 | 58 | 0.8% |
| & | 55 | 0.8% |
| 2 | 54 | 0.8% |
| / | 39 | 0.6% |
| Other values (27) | 240 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44283 | |
| Punctuation | 11 | < 0.1% |
| None | 2 | < 0.1% |
| Letterlike Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5474 | 12.4% | |
| e | 3804 | 8.6% |
| n | 3003 | 6.8% |
| i | 2902 | 6.6% |
| a | 2796 | 6.3% |
| t | 2741 | 6.2% |
| o | 2578 | 5.8% |
| r | 2468 | 5.6% |
| s | 2149 | 4.9% |
| l | 1637 | 3.7% |
| Other values (73) | 14731 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 5 | |
| – | 2 | 18.2% |
| • | 2 | 18.2% |
| ” | 2 | 18.2% |
None
| Value | Count | Frequency (%) |
| é | 1 | |
| ¿ | 1 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 1 |
MISSING 
| Distinct | 657 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 328 |
| Missing (%) | 32.8% |
| Memory size | 60.8 KiB |
Length
| Max length | 75 |
|---|---|
| Median length | 44 |
| Mean length | 19.76339286 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13281 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 645 ? |
|---|---|
| Unique (%) | 96.0% |
Sample
| 1st row | Daidralyn Wood |
|---|---|
| 2nd row | Maimonides Medical Center |
| 3rd row | Rhea King |
| 4th row | Irene Gottlieb |
| 5th row | Robert Leo |
| Value | Count | Frequency (%) |
| construction | 53 | 2.7% |
| of | 51 | 2.6% |
| inc | 46 | 2.3% |
| 44 | 2.2% | |
| nyc | 39 | 2.0% |
| llc | 25 | 1.3% |
| corp | 25 | 1.3% |
| new | 23 | 1.2% |
| and | 19 | 1.0% |
| york | 18 | 0.9% |
| Other values (1136) | 1652 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1328 | 10.0% | |
| e | 968 | 7.3% |
| n | 906 | 6.8% |
| o | 854 | 6.4% |
| t | 833 | 6.3% |
| r | 752 | 5.7% |
| a | 742 | 5.6% |
| i | 730 | 5.5% |
| s | 445 | 3.4% |
| C | 431 | 3.2% |
| Other values (63) | 5292 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8824 | |
| Uppercase Letter | 2856 | 21.5% |
| Space Separator | 1328 | 10.0% |
| Other Punctuation | 167 | 1.3% |
| Decimal Number | 65 | 0.5% |
| Dash Punctuation | 26 | 0.2% |
| Open Punctuation | 7 | 0.1% |
| Close Punctuation | 7 | 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 968 | |
| n | 906 | |
| o | 854 | |
| t | 833 | |
| r | 752 | |
| a | 742 | |
| i | 730 | 8.3% |
| s | 445 | 5.0% |
| l | 390 | 4.4% |
| c | 362 | 4.1% |
| Other values (16) | 1842 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 431 | |
| S | 249 | 8.7% |
| A | 202 | 7.1% |
| N | 183 | 6.4% |
| M | 159 | 5.6% |
| D | 149 | 5.2% |
| I | 137 | 4.8% |
| L | 135 | 4.7% |
| R | 125 | 4.4% |
| P | 123 | 4.3% |
| Other values (16) | 963 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 13 | |
| 4 | 8 | |
| 3 | 8 | |
| 5 | 8 | |
| 1 | 7 | |
| 9 | 6 | |
| 6 | 6 | |
| 0 | 3 | 4.6% |
| 8 | 3 | 4.6% |
| 7 | 3 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 70 | |
| & | 37 | |
| , | 35 | |
| / | 14 | 8.4% |
| ' | 11 | 6.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25 | |
| – | 1 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 1328 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 7 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11680 | |
| Common | 1601 | 12.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 968 | 8.3% |
| n | 906 | 7.8% |
| o | 854 | 7.3% |
| t | 833 | 7.1% |
| r | 752 | 6.4% |
| a | 742 | 6.4% |
| i | 730 | 6.2% |
| s | 445 | 3.8% |
| C | 431 | 3.7% |
| l | 390 | 3.3% |
| Other values (42) | 4629 |
Common
| Value | Count | Frequency (%) |
| 1328 | ||
| . | 70 | 4.4% |
| & | 37 | 2.3% |
| , | 35 | 2.2% |
| - | 25 | 1.6% |
| / | 14 | 0.9% |
| 2 | 13 | 0.8% |
| ' | 11 | 0.7% |
| 4 | 8 | 0.5% |
| 3 | 8 | 0.5% |
| Other values (11) | 52 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13280 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1328 | 10.0% | |
| e | 968 | 7.3% |
| n | 906 | 6.8% |
| o | 854 | 6.4% |
| t | 833 | 6.3% |
| r | 752 | 5.7% |
| a | 742 | 5.6% |
| i | 730 | 5.5% |
| s | 445 | 3.4% |
| C | 431 | 3.2% |
| Other values (62) | 5291 |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
value_of_contract_job_exp_3
Text
MISSING 
| Distinct | 472 |
|---|---|
| Distinct (%) | 72.0% |
| Missing | 344 |
| Missing (%) | 34.4% |
| Memory size | 50.7 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 5.150914634 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3379 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 393 ? |
|---|---|
| Unique (%) | 59.9% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 150000 |
| 3rd row | 100 |
| 4th row | 4665 |
| 5th row | 8000 |
| Value | Count | Frequency (%) |
| 20000 | 9 | 1.4% |
| 5000 | 9 | 1.4% |
| 100000 | 9 | 1.4% |
| 2000 | 9 | 1.4% |
| 150000 | 8 | 1.2% |
| 25000 | 7 | 1.1% |
| 100 | 7 | 1.1% |
| 400000 | 7 | 1.1% |
| 15000 | 6 | 0.9% |
| 30000 | 6 | 0.9% |
| Other values (462) | 579 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1470 | |
| 1 | 320 | 9.5% |
| 5 | 313 | 9.3% |
| 2 | 279 | 8.3% |
| 3 | 180 | 5.3% |
| 6 | 177 | 5.2% |
| 4 | 164 | 4.9% |
| 7 | 162 | 4.8% |
| 8 | 149 | 4.4% |
| 9 | 134 | 4.0% |
| Other values (2) | 31 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3348 | |
| Other Punctuation | 30 | 0.9% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1470 | |
| 1 | 320 | 9.6% |
| 5 | 313 | 9.3% |
| 2 | 279 | 8.3% |
| 3 | 180 | 5.4% |
| 6 | 177 | 5.3% |
| 4 | 164 | 4.9% |
| 7 | 162 | 4.8% |
| 8 | 149 | 4.5% |
| 9 | 134 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 30 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3379 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1470 | |
| 1 | 320 | 9.5% |
| 5 | 313 | 9.3% |
| 2 | 279 | 8.3% |
| 3 | 180 | 5.3% |
| 6 | 177 | 5.2% |
| 4 | 164 | 4.9% |
| 7 | 162 | 4.8% |
| 8 | 149 | 4.4% |
| 9 | 134 | 4.0% |
| Other values (2) | 31 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3379 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1470 | |
| 1 | 320 | 9.5% |
| 5 | 313 | 9.3% |
| 2 | 279 | 8.3% |
| 3 | 180 | 5.3% |
| 6 | 177 | 5.2% |
| 4 | 164 | 4.9% |
| 7 | 162 | 4.8% |
| 8 | 149 | 4.4% |
| 9 | 134 | 4.0% |
| Other values (2) | 31 | 0.9% |
percent_self_performed_job_exp_3
Text
MISSING 
| Distinct | 24 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 387 |
| Missing (%) | 38.7% |
| Memory size | 48.1 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.859706362 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1753 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 100 |
|---|---|
| 2nd row | 100 |
| 3rd row | 100 |
| 4th row | 100 |
| 5th row | 100 |
| Value | Count | Frequency (%) |
| 100 | 534 | |
| 80 | 9 | 1.5% |
| 90 | 9 | 1.5% |
| 75 | 7 | 1.1% |
| 50 | 7 | 1.1% |
| 0 | 6 | 1.0% |
| 70 | 5 | 0.8% |
| 85 | 4 | 0.7% |
| 40 | 4 | 0.7% |
| 10 | 4 | 0.7% |
| Other values (14) | 24 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1119 | |
| 1 | 540 | |
| 5 | 28 | 1.6% |
| 9 | 15 | 0.9% |
| 8 | 15 | 0.9% |
| 7 | 13 | 0.7% |
| 3 | 7 | 0.4% |
| 6 | 6 | 0.3% |
| 2 | 6 | 0.3% |
| 4 | 4 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1753 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1119 | |
| 1 | 540 | |
| 5 | 28 | 1.6% |
| 9 | 15 | 0.9% |
| 8 | 15 | 0.9% |
| 7 | 13 | 0.7% |
| 3 | 7 | 0.4% |
| 6 | 6 | 0.3% |
| 2 | 6 | 0.3% |
| 4 | 4 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1753 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1119 | |
| 1 | 540 | |
| 5 | 28 | 1.6% |
| 9 | 15 | 0.9% |
| 8 | 15 | 0.9% |
| 7 | 13 | 0.7% |
| 3 | 7 | 0.4% |
| 6 | 6 | 0.3% |
| 2 | 6 | 0.3% |
| 4 | 4 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1753 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1119 | |
| 1 | 540 | |
| 5 | 28 | 1.6% |
| 9 | 15 | 0.9% |
| 8 | 15 | 0.9% |
| 7 | 13 | 0.7% |
| 3 | 7 | 0.4% |
| 6 | 6 | 0.3% |
| 2 | 6 | 0.3% |
| 4 | 4 | 0.2% |
MISSING 
| Distinct | 519 |
|---|---|
| Distinct (%) | 77.2% |
| Missing | 328 |
| Missing (%) | 32.8% |
| Memory size | 62.9 KiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Characters and Unicode
| Total characters | 15456 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 420 ? |
|---|---|
| Unique (%) | 62.5% |
Sample
| 1st row | 2020-08-07T00:00:00.000 |
|---|---|
| 2nd row | 2018-01-01T00:00:00.000 |
| 3rd row | 2021-06-01T00:00:00.000 |
| 4th row | 2019-02-10T00:00:00.000 |
| 5th row | 2020-06-01T00:00:00.000 |
| Value | Count | Frequency (%) |
| 2018-01-01t00:00:00.000 | 10 | 1.5% |
| 2019-01-01t00:00:00.000 | 8 | 1.2% |
| 2017-01-01t00:00:00.000 | 7 | 1.0% |
| 2020-01-01t00:00:00.000 | 5 | 0.7% |
| 2014-01-01t00:00:00.000 | 5 | 0.7% |
| 2018-03-01t00:00:00.000 | 5 | 0.7% |
| 2016-01-01t00:00:00.000 | 4 | 0.6% |
| 2017-06-01t00:00:00.000 | 4 | 0.6% |
| 2016-11-01t00:00:00.000 | 4 | 0.6% |
| 2014-09-01t00:00:00.000 | 3 | 0.4% |
| Other values (509) | 617 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7772 | |
| - | 1344 | 8.7% |
| : | 1344 | 8.7% |
| 2 | 1255 | 8.1% |
| 1 | 1226 | 7.9% |
| T | 672 | 4.3% |
| . | 672 | 4.3% |
| 8 | 214 | 1.4% |
| 3 | 194 | 1.3% |
| 9 | 187 | 1.2% |
| Other values (4) | 576 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11424 | |
| Other Punctuation | 2016 | 13.0% |
| Dash Punctuation | 1344 | 8.7% |
| Uppercase Letter | 672 | 4.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7772 | |
| 2 | 1255 | 11.0% |
| 1 | 1226 | 10.7% |
| 8 | 214 | 1.9% |
| 3 | 194 | 1.7% |
| 9 | 187 | 1.6% |
| 7 | 164 | 1.4% |
| 6 | 146 | 1.3% |
| 5 | 135 | 1.2% |
| 4 | 131 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1344 | |
| . | 672 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1344 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14784 | |
| Latin | 672 | 4.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7772 | |
| - | 1344 | 9.1% |
| : | 1344 | 9.1% |
| 2 | 1255 | 8.5% |
| 1 | 1226 | 8.3% |
| . | 672 | 4.5% |
| 8 | 214 | 1.4% |
| 3 | 194 | 1.3% |
| 9 | 187 | 1.3% |
| 7 | 164 | 1.1% |
| Other values (3) | 412 | 2.8% |
Latin
| Value | Count | Frequency (%) |
| T | 672 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15456 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7772 | |
| - | 1344 | 8.7% |
| : | 1344 | 8.7% |
| 2 | 1255 | 8.1% |
| 1 | 1226 | 7.9% |
| T | 672 | 4.3% |
| . | 672 | 4.3% |
| 8 | 214 | 1.4% |
| 3 | 194 | 1.3% |
| 9 | 187 | 1.2% |
| Other values (4) | 576 | 3.7% |
description_of_work_job_exp_3
Text
MISSING 
| Distinct | 669 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 328 |
| Missing (%) | 32.8% |
| Memory size | 87.1 KiB |
Length
| Max length | 100 |
|---|---|
| Median length | 82 |
| Mean length | 58.28869048 |
| Min length | 3 |
Characters and Unicode
| Total characters | 39170 |
|---|---|
| Distinct characters | 90 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 667 ? |
|---|---|
| Unique (%) | 99.3% |
Sample
| 1st row | 024 Inc sold wax melts to the customer Daidralyn Wood |
|---|---|
| 2nd row | Building maintenance, janitorial. |
| 3rd row | Home Health Care |
| 4th row | Provided construction management services. |
| 5th row | Design consultation, Construction documentation, city agency application and construction progress i |
| Value | Count | Frequency (%) |
| and | 314 | 5.7% |
| of | 141 | 2.5% |
| for | 109 | 2.0% |
| the | 98 | 1.8% |
| 86 | 1.6% | |
| to | 83 | 1.5% |
| services | 79 | 1.4% |
| design | 55 | 1.0% |
| in | 53 | 1.0% |
| a | 46 | 0.8% |
| Other values (2131) | 4469 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4833 | 12.3% | |
| e | 3328 | 8.5% |
| n | 2650 | 6.8% |
| i | 2623 | 6.7% |
| t | 2416 | 6.2% |
| a | 2386 | 6.1% |
| o | 2375 | 6.1% |
| r | 2253 | 5.8% |
| s | 1901 | 4.9% |
| l | 1456 | 3.7% |
| Other values (80) | 12949 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29919 | |
| Space Separator | 4833 | 12.3% |
| Uppercase Letter | 3063 | 7.8% |
| Other Punctuation | 887 | 2.3% |
| Decimal Number | 289 | 0.7% |
| Dash Punctuation | 70 | 0.2% |
| Control | 52 | 0.1% |
| Open Punctuation | 25 | 0.1% |
| Close Punctuation | 20 | 0.1% |
| Final Punctuation | 6 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3328 | |
| n | 2650 | 8.9% |
| i | 2623 | 8.8% |
| t | 2416 | 8.1% |
| a | 2386 | 8.0% |
| o | 2375 | 7.9% |
| r | 2253 | 7.5% |
| s | 1901 | 6.4% |
| l | 1456 | 4.9% |
| d | 1308 | 4.4% |
| Other values (16) | 7223 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 291 | 9.5% |
| A | 253 | 8.3% |
| P | 249 | 8.1% |
| C | 240 | 7.8% |
| E | 219 | 7.1% |
| I | 209 | 6.8% |
| R | 199 | 6.5% |
| T | 197 | 6.4% |
| D | 182 | 5.9% |
| N | 142 | 4.6% |
| Other values (16) | 882 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 443 | |
| . | 279 | |
| & | 58 | 6.5% |
| / | 48 | 5.4% |
| : | 17 | 1.9% |
| ; | 14 | 1.6% |
| ' | 12 | 1.4% |
| • | 4 | 0.5% |
| # | 4 | 0.5% |
| " | 3 | 0.3% |
| Other values (3) | 5 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 72 | |
| 2 | 52 | |
| 1 | 48 | |
| 5 | 25 | 8.7% |
| 4 | 24 | 8.3% |
| 3 | 23 | 8.0% |
| 6 | 15 | 5.2% |
| 7 | 12 | 4.2% |
| 9 | 10 | 3.5% |
| 8 | 8 | 2.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 | |
| = | 1 | |
| < | 1 | |
| > | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 68 | |
| — | 1 | 1.4% |
| – | 1 | 1.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 24 | |
| [ | 1 | 4.0% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 5 | |
| ” | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 4833 |
Control
| Value | Count | Frequency (%) |
| 52 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 20 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32982 | |
| Common | 6188 | 15.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3328 | 10.1% |
| n | 2650 | 8.0% |
| i | 2623 | 8.0% |
| t | 2416 | 7.3% |
| a | 2386 | 7.2% |
| o | 2375 | 7.2% |
| r | 2253 | 6.8% |
| s | 1901 | 5.8% |
| l | 1456 | 4.4% |
| d | 1308 | 4.0% |
| Other values (42) | 10286 |
Common
| Value | Count | Frequency (%) |
| 4833 | ||
| , | 443 | 7.2% |
| . | 279 | 4.5% |
| 0 | 72 | 1.2% |
| - | 68 | 1.1% |
| & | 58 | 0.9% |
| 52 | 0.8% | |
| 2 | 52 | 0.8% |
| 1 | 48 | 0.8% |
| / | 48 | 0.8% |
| Other values (28) | 235 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39158 | |
| Punctuation | 12 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4833 | 12.3% | |
| e | 3328 | 8.5% |
| n | 2650 | 6.8% |
| i | 2623 | 6.7% |
| t | 2416 | 6.2% |
| a | 2386 | 6.1% |
| o | 2375 | 6.1% |
| r | 2253 | 5.8% |
| s | 1901 | 4.9% |
| l | 1456 | 3.7% |
| Other values (75) | 12937 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 5 | |
| • | 4 | |
| ” | 1 | 8.3% |
| — | 1 | 8.3% |
| – | 1 | 8.3% |
capacity_building_programs
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1000 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 7.9 KiB |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.6 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.838 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2838 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yes |
|---|---|
| 2nd row | Yes |
| 3rd row | No |
| 4th row | Yes |
| 5th row | Yes |
| Value | Count | Frequency (%) |
| yes | 838 | |
| no | 162 | 16.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 838 | |
| e | 838 | |
| s | 838 | |
| N | 162 | 5.7% |
| o | 162 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1838 | |
| Uppercase Letter | 1000 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 838 | |
| s | 838 | |
| o | 162 | 8.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 838 | |
| N | 162 | 16.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2838 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 838 | |
| e | 838 | |
| s | 838 | |
| N | 162 | 5.7% |
| o | 162 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2838 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 838 | |
| e | 838 | |
| s | 838 | |
| N | 162 | 5.7% |
| o | 162 | 5.7% |
borough
Text
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 380 |
| Missing (%) | 38.0% |
| Memory size | 51.0 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.403225806 |
| Min length | 5 |
Characters and Unicode
| Total characters | 4590 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BRONX |
| 3rd row | BROOKLYN |
| 4th row | BROOKLYN |
| 5th row | MANHATTAN |
| Value | Count | Frequency (%) |
| queens | 182 | |
| brooklyn | 180 | |
| manhattan | 166 | |
| bronx | 66 | 10.2% |
| staten | 26 | 4.0% |
| is | 26 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 786 | |
| A | 524 | |
| O | 426 | |
| E | 390 | 8.5% |
| T | 384 | 8.4% |
| B | 246 | 5.4% |
| R | 246 | 5.4% |
| S | 234 | 5.1% |
| Q | 182 | 4.0% |
| U | 182 | 4.0% |
| Other values (8) | 990 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4564 | |
| Space Separator | 26 | 0.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 786 | |
| A | 524 | |
| O | 426 | |
| E | 390 | |
| T | 384 | 8.4% |
| B | 246 | 5.4% |
| R | 246 | 5.4% |
| S | 234 | 5.1% |
| Q | 182 | 4.0% |
| U | 182 | 4.0% |
| Other values (7) | 964 |
Space Separator
| Value | Count | Frequency (%) |
| 26 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4564 | |
| Common | 26 | 0.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 786 | |
| A | 524 | |
| O | 426 | |
| E | 390 | |
| T | 384 | 8.4% |
| B | 246 | 5.4% |
| R | 246 | 5.4% |
| S | 234 | 5.1% |
| Q | 182 | 4.0% |
| U | 182 | 4.0% |
| Other values (7) | 964 |
Common
| Value | Count | Frequency (%) |
| 26 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4590 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 786 | |
| A | 524 | |
| O | 426 | |
| E | 390 | 8.5% |
| T | 384 | 8.4% |
| B | 246 | 5.4% |
| R | 246 | 5.4% |
| S | 234 | 5.1% |
| Q | 182 | 4.0% |
| U | 182 | 4.0% |
| Other values (8) | 990 |
latitude
Text
MISSING 
| Distinct | 597 |
|---|---|
| Distinct (%) | 96.3% |
| Missing | 380 |
| Missing (%) | 38.0% |
| Memory size | 51.9 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.888709677 |
| Min length | 6 |
Characters and Unicode
| Total characters | 5511 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 575 ? |
|---|---|
| Unique (%) | 92.7% |
Sample
| 1st row | 40.714065 |
|---|---|
| 2nd row | 40.863508 |
| 3rd row | 40.680772 |
| 4th row | 40.703905 |
| 5th row | 40.763365 |
| Value | Count | Frequency (%) |
| 40.74306 | 3 | 0.5% |
| 40.588719 | 2 | 0.3% |
| 40.815214 | 2 | 0.3% |
| 40.665271 | 2 | 0.3% |
| 40.750564 | 2 | 0.3% |
| 40.717612 | 2 | 0.3% |
| 40.71013 | 2 | 0.3% |
| 40.656142 | 2 | 0.3% |
| 40.739873 | 2 | 0.3% |
| 40.746091 | 2 | 0.3% |
| Other values (587) | 599 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 913 | |
| 0 | 897 | |
| . | 620 | |
| 7 | 576 | |
| 6 | 509 | |
| 8 | 387 | |
| 5 | 359 | 6.5% |
| 1 | 332 | 6.0% |
| 9 | 325 | 5.9% |
| 3 | 306 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4891 | |
| Other Punctuation | 620 | 11.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 913 | |
| 0 | 897 | |
| 7 | 576 | |
| 6 | 509 | |
| 8 | 387 | |
| 5 | 359 | 7.3% |
| 1 | 332 | 6.8% |
| 9 | 325 | 6.6% |
| 3 | 306 | 6.3% |
| 2 | 287 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 620 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5511 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 913 | |
| 0 | 897 | |
| . | 620 | |
| 7 | 576 | |
| 6 | 509 | |
| 8 | 387 | |
| 5 | 359 | 6.5% |
| 1 | 332 | 6.0% |
| 9 | 325 | 5.9% |
| 3 | 306 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5511 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 913 | |
| 0 | 897 | |
| . | 620 | |
| 7 | 576 | |
| 6 | 509 | |
| 8 | 387 | |
| 5 | 359 | 6.5% |
| 1 | 332 | 6.0% |
| 9 | 325 | 5.9% |
| 3 | 306 | 5.6% |
longitude
Text
MISSING 
| Distinct | 597 |
|---|---|
| Distinct (%) | 96.3% |
| Missing | 380 |
| Missing (%) | 38.0% |
| Memory size | 52.5 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.896774194 |
| Min length | 8 |
Characters and Unicode
| Total characters | 6136 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 575 ? |
|---|---|
| Unique (%) | 92.7% |
Sample
| 1st row | -73.960252 |
|---|---|
| 2nd row | -73.821595 |
| 3rd row | -73.962817 |
| 4th row | -73.928988 |
| 5th row | -73.994509 |
| Value | Count | Frequency (%) |
| 73.935652 | 3 | 0.5% |
| 73.991034 | 2 | 0.3% |
| 73.979941 | 2 | 0.3% |
| 73.936906 | 2 | 0.3% |
| 73.85825 | 2 | 0.3% |
| 73.951958 | 2 | 0.3% |
| 73.945718 | 2 | 0.3% |
| 73.981771 | 2 | 0.3% |
| 74.005109 | 2 | 0.3% |
| 73.929345 | 2 | 0.3% |
| Other values (587) | 599 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 1000 | |
| 3 | 798 | |
| 9 | 658 | |
| - | 620 | |
| . | 620 | |
| 8 | 464 | |
| 4 | 412 | |
| 5 | 336 | 5.5% |
| 6 | 315 | 5.1% |
| 0 | 308 | 5.0% |
| Other values (2) | 605 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4896 | |
| Dash Punctuation | 620 | 10.1% |
| Other Punctuation | 620 | 10.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 1000 | |
| 3 | 798 | |
| 9 | 658 | |
| 8 | 464 | |
| 4 | 412 | |
| 5 | 336 | 6.9% |
| 6 | 315 | 6.4% |
| 0 | 308 | 6.3% |
| 1 | 305 | 6.2% |
| 2 | 300 | 6.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 620 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 620 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6136 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 1000 | |
| 3 | 798 | |
| 9 | 658 | |
| - | 620 | |
| . | 620 | |
| 8 | 464 | |
| 4 | 412 | |
| 5 | 336 | 5.5% |
| 6 | 315 | 5.1% |
| 0 | 308 | 5.0% |
| Other values (2) | 605 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6136 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 1000 | |
| 3 | 798 | |
| 9 | 658 | |
| - | 620 | |
| . | 620 | |
| 8 | 464 | |
| 4 | 412 | |
| 5 | 336 | 5.5% |
| 6 | 315 | 5.1% |
| 0 | 308 | 5.0% |
| Other values (2) | 605 |
community_board
Text
MISSING 
| Distinct | 59 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 380 |
| Missing (%) | 38.0% |
| Memory size | 48.3 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1860 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 301 |
|---|---|
| 2nd row | 210 |
| 3rd row | 308 |
| 4th row | 304 |
| 5th row | 104 |
| Value | Count | Frequency (%) |
| 105 | 46 | 7.4% |
| 401 | 24 | 3.9% |
| 101 | 23 | 3.7% |
| 302 | 22 | 3.5% |
| 413 | 21 | 3.4% |
| 412 | 20 | 3.2% |
| 402 | 19 | 3.1% |
| 408 | 18 | 2.9% |
| 104 | 17 | 2.7% |
| 110 | 17 | 2.7% |
| Other values (49) | 393 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 478 | |
| 0 | 459 | |
| 3 | 247 | |
| 4 | 231 | |
| 2 | 181 | 9.7% |
| 5 | 107 | 5.8% |
| 8 | 52 | 2.8% |
| 7 | 41 | 2.2% |
| 6 | 35 | 1.9% |
| 9 | 29 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1860 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 478 | |
| 0 | 459 | |
| 3 | 247 | |
| 4 | 231 | |
| 2 | 181 | 9.7% |
| 5 | 107 | 5.8% |
| 8 | 52 | 2.8% |
| 7 | 41 | 2.2% |
| 6 | 35 | 1.9% |
| 9 | 29 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1860 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 478 | |
| 0 | 459 | |
| 3 | 247 | |
| 4 | 231 | |
| 2 | 181 | 9.7% |
| 5 | 107 | 5.8% |
| 8 | 52 | 2.8% |
| 7 | 41 | 2.2% |
| 6 | 35 | 1.9% |
| 9 | 29 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1860 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 478 | |
| 0 | 459 | |
| 3 | 247 | |
| 4 | 231 | |
| 2 | 181 | 9.7% |
| 5 | 107 | 5.8% |
| 8 | 52 | 2.8% |
| 7 | 41 | 2.2% |
| 6 | 35 | 1.9% |
| 9 | 29 | 1.6% |
council_district
Text
MISSING 
| Distinct | 51 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 380 |
| Missing (%) | 38.0% |
| Memory size | 47.6 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.729032258 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1072 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 34 |
|---|---|
| 2nd row | 12 |
| 3rd row | 35 |
| 4th row | 34 |
| 5th row | 3 |
| Value | Count | Frequency (%) |
| 3 | 48 | 7.7% |
| 1 | 32 | 5.2% |
| 26 | 27 | 4.4% |
| 4 | 26 | 4.2% |
| 33 | 21 | 3.4% |
| 9 | 20 | 3.2% |
| 23 | 20 | 3.2% |
| 22 | 19 | 3.1% |
| 35 | 19 | 3.1% |
| 40 | 18 | 2.9% |
| Other values (41) | 370 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 223 | |
| 2 | 191 | |
| 4 | 164 | |
| 1 | 151 | |
| 9 | 69 | 6.4% |
| 5 | 65 | 6.1% |
| 6 | 64 | 6.0% |
| 7 | 55 | 5.1% |
| 0 | 46 | 4.3% |
| 8 | 44 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1072 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 223 | |
| 2 | 191 | |
| 4 | 164 | |
| 1 | 151 | |
| 9 | 69 | 6.4% |
| 5 | 65 | 6.1% |
| 6 | 64 | 6.0% |
| 7 | 55 | 5.1% |
| 0 | 46 | 4.3% |
| 8 | 44 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1072 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 223 | |
| 2 | 191 | |
| 4 | 164 | |
| 1 | 151 | |
| 9 | 69 | 6.4% |
| 5 | 65 | 6.1% |
| 6 | 64 | 6.0% |
| 7 | 55 | 5.1% |
| 0 | 46 | 4.3% |
| 8 | 44 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1072 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 223 | |
| 2 | 191 | |
| 4 | 164 | |
| 1 | 151 | |
| 9 | 69 | 6.4% |
| 5 | 65 | 6.1% |
| 6 | 64 | 6.0% |
| 7 | 55 | 5.1% |
| 0 | 46 | 4.3% |
| 8 | 44 | 4.1% |
bin
Text
MISSING 
| Distinct | 590 |
|---|---|
| Distinct (%) | 96.2% |
| Missing | 387 |
| Missing (%) | 38.7% |
| Memory size | 50.5 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 4291 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 568 ? |
|---|---|
| Unique (%) | 92.7% |
Sample
| 1st row | 3397699 |
|---|---|
| 2nd row | 2093860 |
| 3rd row | 3027510 |
| 4th row | 3258681 |
| 5th row | 1085961 |
| Value | Count | Frequency (%) |
| 4003539 | 3 | 0.5% |
| 4005198 | 2 | 0.3% |
| 3337638 | 2 | 0.3% |
| 3059161 | 2 | 0.3% |
| 4003108 | 2 | 0.3% |
| 2097441 | 2 | 0.3% |
| 1001409 | 2 | 0.3% |
| 1015690 | 2 | 0.3% |
| 3116055 | 2 | 0.3% |
| 4061587 | 2 | 0.3% |
| Other values (580) | 592 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 737 | |
| 1 | 617 | |
| 3 | 532 | |
| 4 | 502 | |
| 2 | 426 | |
| 5 | 360 | |
| 9 | 292 | 6.8% |
| 7 | 288 | 6.7% |
| 8 | 279 | 6.5% |
| 6 | 258 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4291 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 737 | |
| 1 | 617 | |
| 3 | 532 | |
| 4 | 502 | |
| 2 | 426 | |
| 5 | 360 | |
| 9 | 292 | 6.8% |
| 7 | 288 | 6.7% |
| 8 | 279 | 6.5% |
| 6 | 258 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4291 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 737 | |
| 1 | 617 | |
| 3 | 532 | |
| 4 | 502 | |
| 2 | 426 | |
| 5 | 360 | |
| 9 | 292 | 6.8% |
| 7 | 288 | 6.7% |
| 8 | 279 | 6.5% |
| 6 | 258 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4291 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 737 | |
| 1 | 617 | |
| 3 | 532 | |
| 4 | 502 | |
| 2 | 426 | |
| 5 | 360 | |
| 9 | 292 | 6.8% |
| 7 | 288 | 6.7% |
| 8 | 279 | 6.5% |
| 6 | 258 | 6.0% |
bbl
Text
MISSING 
| Distinct | 589 |
|---|---|
| Distinct (%) | 96.1% |
| Missing | 387 |
| Missing (%) | 38.7% |
| Memory size | 52.3 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 6130 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 566 ? |
|---|---|
| Unique (%) | 92.3% |
Sample
| 1st row | 3023937502 |
|---|---|
| 2nd row | 2051350051 |
| 3rd row | 3011240022 |
| 4th row | 3031580123 |
| 5th row | 1010750047 |
| Value | Count | Frequency (%) |
| 4002810001 | 3 | 0.5% |
| 1008290040 | 2 | 0.3% |
| 1007840074 | 2 | 0.3% |
| 4070750037 | 2 | 0.3% |
| 3011970006 | 2 | 0.3% |
| 3045040107 | 2 | 0.3% |
| 3020230001 | 2 | 0.3% |
| 3030300001 | 2 | 0.3% |
| 5055150016 | 2 | 0.3% |
| 2051350051 | 2 | 0.3% |
| Other values (579) | 592 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2275 | |
| 1 | 776 | 12.7% |
| 3 | 564 | 9.2% |
| 2 | 507 | 8.3% |
| 4 | 494 | 8.1% |
| 5 | 399 | 6.5% |
| 7 | 360 | 5.9% |
| 6 | 266 | 4.3% |
| 8 | 247 | 4.0% |
| 9 | 242 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6130 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2275 | |
| 1 | 776 | 12.7% |
| 3 | 564 | 9.2% |
| 2 | 507 | 8.3% |
| 4 | 494 | 8.1% |
| 5 | 399 | 6.5% |
| 7 | 360 | 5.9% |
| 6 | 266 | 4.3% |
| 8 | 247 | 4.0% |
| 9 | 242 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6130 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2275 | |
| 1 | 776 | 12.7% |
| 3 | 564 | 9.2% |
| 2 | 507 | 8.3% |
| 4 | 494 | 8.1% |
| 5 | 399 | 6.5% |
| 7 | 360 | 5.9% |
| 6 | 266 | 4.3% |
| 8 | 247 | 4.0% |
| 9 | 242 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6130 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2275 | |
| 1 | 776 | 12.7% |
| 3 | 564 | 9.2% |
| 2 | 507 | 8.3% |
| 4 | 494 | 8.1% |
| 5 | 399 | 6.5% |
| 7 | 360 | 5.9% |
| 6 | 266 | 4.3% |
| 8 | 247 | 4.0% |
| 9 | 242 | 3.9% |
MISSING 
| Distinct | 408 |
|---|---|
| Distinct (%) | 65.8% |
| Missing | 380 |
| Missing (%) | 38.0% |
| Memory size | 48.5 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.251612903 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2016 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 278 ? |
|---|---|
| Unique (%) | 44.8% |
Sample
| 1st row | 551 |
|---|---|
| 2nd row | 30202 |
| 3rd row | 203 |
| 4th row | 427 |
| 5th row | 12902 |
| Value | Count | Frequency (%) |
| 21 | 8 | 1.3% |
| 7 | 7 | 1.1% |
| 109 | 6 | 1.0% |
| 1903 | 5 | 0.8% |
| 76 | 5 | 0.8% |
| 71 | 5 | 0.8% |
| 95 | 5 | 0.8% |
| 1220 | 5 | 0.8% |
| 37 | 5 | 0.8% |
| 29102 | 4 | 0.6% |
| Other values (398) | 565 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 364 | |
| 2 | 280 | |
| 0 | 261 | |
| 3 | 203 | |
| 9 | 173 | |
| 5 | 173 | |
| 4 | 167 | |
| 7 | 145 | 7.2% |
| 8 | 127 | 6.3% |
| 6 | 123 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2016 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 364 | |
| 2 | 280 | |
| 0 | 261 | |
| 3 | 203 | |
| 9 | 173 | |
| 5 | 173 | |
| 4 | 167 | |
| 7 | 145 | 7.2% |
| 8 | 127 | 6.3% |
| 6 | 123 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2016 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 364 | |
| 2 | 280 | |
| 0 | 261 | |
| 3 | 203 | |
| 9 | 173 | |
| 5 | 173 | |
| 4 | 167 | |
| 7 | 145 | 7.2% |
| 8 | 127 | 6.3% |
| 6 | 123 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2016 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 364 | |
| 2 | 280 | |
| 0 | 261 | |
| 3 | 203 | |
| 9 | 173 | |
| 5 | 173 | |
| 4 | 167 | |
| 7 | 145 | 7.2% |
| 8 | 127 | 6.3% |
| 6 | 123 | 6.1% |
neighborhood_tabulation_area_nta_2020_
Text
MISSING 
| Distinct | 173 |
|---|---|
| Distinct (%) | 27.9% |
| Missing | 380 |
| Missing (%) | 38.0% |
| Memory size | 50.1 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3720 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 35 ? |
|---|---|
| Unique (%) | 5.6% |
Sample
| 1st row | BK0102 |
|---|---|
| 2nd row | BX1004 |
| 3rd row | BK0801 |
| 4th row | BK0401 |
| 5th row | MN0402 |
| Value | Count | Frequency (%) |
| mn0502 | 23 | 3.7% |
| mn0501 | 22 | 3.5% |
| mn0101 | 18 | 2.9% |
| mn0401 | 10 | 1.6% |
| mn0201 | 10 | 1.6% |
| qn0202 | 9 | 1.5% |
| mn1001 | 9 | 1.5% |
| bk0301 | 9 | 1.5% |
| qn0201 | 8 | 1.3% |
| qn1303 | 8 | 1.3% |
| Other values (163) | 494 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1069 | |
| 1 | 539 | |
| N | 348 | 9.4% |
| 2 | 318 | 8.5% |
| B | 246 | 6.6% |
| Q | 182 | 4.9% |
| K | 180 | 4.8% |
| 3 | 172 | 4.6% |
| M | 166 | 4.5% |
| 5 | 112 | 3.0% |
| Other values (8) | 388 | 10.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2480 | |
| Uppercase Letter | 1240 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1069 | |
| 1 | 539 | |
| 2 | 318 | 12.8% |
| 3 | 172 | 6.9% |
| 5 | 112 | 4.5% |
| 4 | 101 | 4.1% |
| 7 | 51 | 2.1% |
| 8 | 48 | 1.9% |
| 6 | 43 | 1.7% |
| 9 | 27 | 1.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 348 | |
| B | 246 | |
| Q | 182 | |
| K | 180 | |
| M | 166 | |
| X | 66 | 5.3% |
| S | 26 | 2.1% |
| I | 26 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2480 | |
| Latin | 1240 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1069 | |
| 1 | 539 | |
| 2 | 318 | 12.8% |
| 3 | 172 | 6.9% |
| 5 | 112 | 4.5% |
| 4 | 101 | 4.1% |
| 7 | 51 | 2.1% |
| 8 | 48 | 1.9% |
| 6 | 43 | 1.7% |
| 9 | 27 | 1.1% |
Latin
| Value | Count | Frequency (%) |
| N | 348 | |
| B | 246 | |
| Q | 182 | |
| K | 180 | |
| M | 166 | |
| X | 66 | 5.3% |
| S | 26 | 2.1% |
| I | 26 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1069 | |
| 1 | 539 | |
| N | 348 | 9.4% |
| 2 | 318 | 8.5% |
| B | 246 | 6.6% |
| Q | 182 | 4.9% |
| K | 180 | 4.8% |
| 3 | 172 | 4.6% |
| M | 166 | 4.5% |
| 5 | 112 | 3.0% |
| Other values (8) | 388 | 10.4% |